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ABSTRACT 


Exosomes are a type of extracellular vesicle that carry cargo, including proteins and 
nucleic acids, that reports on the conditions of their cell of origin. Exosomes can be isolated 
from a range of biological fluids, including blood and urine, and their use in clinical diagnostic 
applications has been extensively researched. Breath is an excellent source of biomarkers. 
Exosomes collected from exhaled breath concentrate offer untapped potential in the field. For 
example, exosomes secreted from the cells of the lung carry biomarkers of lung health and so 
report selectively on lung health. However, whilst several publications report the presence of 
breath exosomes, and some discuss potential applications, there is no published work on 
developing standardized procedures for isolating high purity exosomes for biomarker 
identification, nor have breath exosomes been extensively characterized. Studying exosomes 
can potentially increase the specificity of diagnosis with an ultimate goal to translate early 
diagnosis into early treatment. 

In this study we report results of optimized methodology for isolation of exosomes from 
human breath and undertaken phenotyping of breath exosomes by comparing them to cell- 
derived exosomes. Determining the benchmark proteome of breath exosomes became the 
major achievement of this study. The proteome profile of breath-derived exosomes was 
identified for the first time from exosomes obtained from the breath of 55 healthy volunteers 
indicating that the protein cargo of cell- and breath-derived exosomes are somewhat similar. 
The results of this study suggest that the exosomal breath proteome may be used in future 
studies identifying new biomarkers of lung dysfunction, including lung cancer. The possibility 
of identifying putative lung cancer biomarkers was shown using exosomes isolated from the 
lung cancer cell model (A549), from which four candidates were selected and examined. 
Promising results from time-resolved studies concluded that a FRET-based assay can be 
considered as a new way to prove the presence of exosomes and with an ability to differentiate 
between cancer and normal cells. We foresee the use of exosomes in non-invasive breath-based 
clinical applications for medical diagnostics for lung cancer as an important future 


application. 
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CHAPTER 1. INTRODUCTION 


1.1 EXOSOMES 


1.1.1 General characteristics of exosomes 


Over 54 years ago, when Wolf first discovered exosomes while observing platelets in 


»l 


human plasma, he referred them as “platelet dust". Exosomes were simply considered as 
“trash сотрагітепіѕ”2. The term “exosomes” was proposed later іп 1987 by Johnstone et al? 
to clarify that they are a specific subtype of vesicles. They are small single-membraned vesicles 
formed by an endosomal route (i.e., through multivesicular bodies) and sized between 40 and 
100 nm, although some sources consider them as ranging in size from 30 to 150 nm. Exosomes 
are enriched with proteins, nucleic acids, and lipids and originated from inward budding of the 


cellular membrane? (Fig. 1). 
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Figure 1. Overall composition of extracellular vesicles. Adapted from Colombo et al ^. 


Exosomes are generated by all type of cells and can be found in all biological fluids 


including tissue samples. In recent decades, exosomes have been isolated and characterized 


1 


from various types of biological fluids, such as blood and plasma®’, serum’, urine®"!! , breast 


20, and cerebral spinal fluid?!. Several studies show that some 


milk'?-'^, saliva!^-?, tears 
exosomes remain associated with the original cell, whilst others are more widely distributed 


continuing sending signals to other surrounding cells???*. More recently, their role has proven 
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to be more important than previously thought. For example, it is now believed that exosomes 
enable intercellular communication???* by trafficking contents (nucleic acids, proteins and 
lipids) that provide information about the cell of origin and play vital roles in many cell 
mechanisms, including altering immune cell responses?’, enhancing angiogenesis’, and 
inducing apoptosis????, Cellular components of exosomes can be “reassembled” into new 
vesicles and passed to other cells keeping intercellular communication active?! In addition, 
proteins found within exosomes have been shown to reflect the cell of origin???. 

Besides exosomes, other subtypes of mammalian-derived extracellular vesicles (EVs) 
are known. They are larger in size than exosomes and known as microvesicles (or ectosomes) 
and apoptotic bodies223132 The difference between exosomes and other EVs is in their 
biogenesis and will be explained below in 1.1.2. Briefly, microvesicles can reach up to 1 um 
in diameter and are formed by outward budding, which is the opposite process of exosome 
formation. The composition of microvesicles will depend on the composition of the parental 
cell, the microenvironment, and on the triggers that resulted in their release?????^, Apoptotic 
bodies can range in size anywhere from 50 nm to 5000 nm. Unlike exosomes and 
microvesicles, they are highly enriched with proteins associated with the Golgi apparatus, 
nucleus, and mitochondria due to their release by dying се1152536. 

Although the protein content of EVs depends on their route of formation *°%’, there are 
no specific single markers to distinguish exosomes from other subtypes of EVs. Instead, a 
combination of markers can be used to characterize each type of EV??. For example, the 
transmembrane protein CD81 is commonly found in exosomes, while another transmembrane 


3839 and microvesicles. A 


protein, CD63, can be found in many EVs including exosomes 
combination of CD81 and CD63 is generally considered an exosome-specific combination. 
In a similar manner, several markers should be detected to verify the subtype of many types of 


EVs 4041, 


1.1.2 Biogenesis and secretion of exosomes 


The discovery of the molecular content of EVs opened a new page, allowing researchers 
to learn more about the biogenesis of exosomes. Exosomes are formed within endosomal 
membranes after going through several steps: formation and maturation of early endosomes; 
the inward budding of the early endosome membrane results in formation of intraluminal 


vesicles and so forming multivesicular bodies; transport of multivesicular bodies to the plasma 
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membrane; fusion of multivesicular bodies with plasma membrane; and release of the vesicular 


content as exosomes”°*? (Fig. 2). 
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Figure 2. Formation and secretion of exosomes. Adapted from Henrix and Hume ®. 


The formation and maturation of early endosomes begin with the endocytic pathway. 
The endosomal network is a very dynamic system where each compartment represents a unique 
identity. Each compartment can be easily altered by communication between cargo that gets 
transferred between membrane-bound compartments. This is the place where the initial cargo 
sorting “decision” occurs, and the fates of future exosomes are decided. There are several 
mechanisms associated with membrane trafficking. The two most important mechanisms that 
not only actively participate in trafficking but also control the membrane identity and function 
are an alteration in phosphatidylinositol phospholipids and activation of Rab family GTPases. 
Phosphatidylinositol phospholipids are responsible for directional flow of membrane cargoes 
and have a close relationship with RabGTPases?. Although the process of exosome biogenesis 
is not fully understood, studies have shown that several molecules of the Rab family 
(Rab GTPases) are directly involved in the process by allowing the formation of early 
endosomes: Rab5 in transporting early endosomes, Rab4 and Rab! 1 in sorting and recycling 
endosomes, and Rab7 in lysosomes and targeted degradation of endosome cargo proteins. 

The maturation process of early endosomes and their conversion to multivesicular bodies 
is also known as the Rab cascade ^^^*, Briefly, Rab5 recruits guanine-nucleotide exchange 
factors that then recruit Rab7, which follow recruitment of GTPase-activated proteins. During 
this process, the membrane's identity changes from a Rab5 to a Rab7 domain, which allows 


formation of more complex domains such as intraluminal vesicles and later multivesicular 
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bodies. Rab5 and Rab7 continue to actively participate іп the endosome maturation process 
allowing further membrane fusion and creating a platform for protein localization that can be 
recognized by a zinc finger phosphoinositide-binding domain named after the four proteins — 
Fablp, YOTB, Vaclp, and EEA1474. Further, ubiquitination, which depends on the 
RabGTPases mentioned above”, also plays an important role. Ubiquitin is a small protein that 
triggers degradation when fusion of multivesicular bodies and the lysosome occurs sending a 
signal to release the content from plasma membrane via intraluminal. However, not all proteins 
that get sorted to intraluminal vesicles are destined for degradation. Some proteins, like 
tetraspanins, tend to accumulate on the plasma membrane creating tetraspanin-enriched 
microdomains that assist with selection of exosomal cargo (proteins and miRNA) and 
participate in binding and uptake exosomes by cells. Those intraluminal vesicles that are not 
targeted to the lysosome are progenitors for exosomes, and the tetraspanins they bear will be 
defined cargo of exosomes??. The process of initial clustering of ubiquitinated cargo is a 
beginning step in a process of exosomal formation in which endosomal sorting complex 
required for transport (ESCRT) machinery directly participates. This complex involved in this 
step is widely described in the literature as ESCRT-0. Some of the clustered cargo will be 
incorporated into intraluminal vesicles and will become an important part of future 
exosomes??, 

The process of exchange of compartments within an endosomal network is known as 


endosomal maturation and this is the first step of the ESCRT machinery (Fig. 3). 
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Figure 3. Model of ESCRT Machinery. The diagram shows interactions between different subunits of machinery: ESCRT-0 
components in purple, ESCRT-I in green, ESCRT-II in red, and ESCRT-III associated in blue. Relevant domains are identified. 
Jbiquitinated (Ub) cargo, phosphatidylinositol 3-phosphate (РІЗР), lyso(bis)phosphatidic acid (LBPA), and clathrin 
associated with ESCRT-0 included as a model targeting factor. Lipids shown as yellow flags indicate РІЗР binding cites. 
Deubiquitinating enzymes such as AMSH and UBPY, ubiquitin ligases, and the kinases Aurora B and ULK3 are known to 
interact with several ESCRT subunits. Abbreviations: CB, clathrin-box motif; КҮРЕ, Fablp/YOTB/VacIp/EEAl1 domain; 
GLUE, GRAM-like ubiquitin in EAP45 domain; MIT, microtubule interacting and transport domain; Ub, ubiquitin; UEV, 
ubiquitin-conjugated enzyme E2 variant; UIM, ubiquitin-interacting motif. Adapted from Christ et al *!. 
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This process reflects that proteins including phosphatidylinositol kinases like PIKfyve 
get recruited at this stage, causing a shift from phosphatidylinositol-3-phosphate to 
phosphatidylinositol-3,5-bisphosphate. This conversion of the phosphatidylinositol is an 
important maturation step and is linked to recruitment of parts of the ESCRT machinery. 

The inward budding of the early endosome membrane results in formation of 
intraluminal vesicles, thereby creating multivesicular bodies, and marks the next step of 
endosome maturation assisted by the ESCRT тасһіпегу26. The ESCRT-I and ESCRT-II 
complexes play a critical role in formation of future exosomes as they are responsible for 
clustering of ubiquitinated proteins into intraluminal vesicles, initiated by ESCRT-0 (the step 
described above)*?. The role of ESCRT-I is to communicate with ESCRT-0 and ESCRT-II. 
The ESCRT-I complex consists of several proteins, Tsg 101, Vps28, Vps37, and hMvb12 (Fig. 
3)?. Attention has been given to tumor susceptibility gene 101 (Tsg101)?. A recent study 
by Villarroya-Beltri et a* has shown that Tsg101 is directly involved in the lysosomal 
degradation process of multivesicular bodies proteins. A posttranslational ubiquitin-like 
modification that is known as ISGylation can regulate multivesicular bodies?^. Once Tsg 101 
is ISGylated, then secretion of exosomes is suppressed and fusion of multivesicular bodies with 
lysosome is promoted. ESCRT-II has a Y-shaped heterotetrametric structure and contains 
several Vps units that interact with arriving cargo and other proteins. The main role of the 
ESCRT-II complex is to initiate the assembly of ЕЗСЕТ-Ш, which will drive vesicle budding 
to generate intraluminal vesicles. Mature multivesicular bodies can then either fuse with the 
plasma membrane and result in exosome secretion or fuse with lysosomes or 
autophagosomes, thereby degrading their сагоо22:55, Protein degradation is instigated following 
fusion of multivesicular bodies with autophagic vacuoles by a calcium-dependent mechanism. 
The single multivesicular body then has a choice to go down two different pathways. If a 
vesicle chooses to go down by lysosome degradation pathway, it cannot go on to become an 
exosome??55, In the cancer microenvironment, the balance between fusion with the plasma 
membrane and fusion with lysosomes is shifted, resulting in the increased release of exosomes 
compared to the healthy cellular environment? 

Intraluminal vesicles formation is a lipid-driven process that requires the presence of 
specific lipids such as lysobisphosphatidic acid and ceramides obtained from late endosomes 
and their membrane constituents. Those lipids create a lipid platform to which ESCRT-0 binds 
and sorts ubiquitinated membrane proteins. The intraluminal vesicles formation goes through 
two important steps: enrichment with tetraspanins and recruiting ESCRT complexes to the site 


where formation of intraluminal vesicles occurs. Previous studies suggested that ESCRT-I and 
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II begin the process of intraluminal membrane budding that ESCRT-II had activated. Briefly, 
the presence of specific liquids assists in recruiting ESCRT-I and II complexes. Later in the 
process, ESCRT-III is recruited by the ESCRT 0-П complexes through one of two proteins- 
programmed cell death 6 interacting protein (PCD6IP), which is also known as Alix, and 
Tsg101 that is associated with PCD6IP. Alix, being directly involved in budding and scission 
processes, in fact plays the role of a communication bridge, connecting ESCRT-I and ESCRT- 
Ш and keeping interactions with all ESCRT units through a highly charged assembly 
of multimers?". Deubiquitinating enzymes (Fig. 3) actively contribute to the recycling of 
ubiquitin before degradation occurs and controlling proteins stability. AMSH3 (Associated 
Molecule with the SH3 domain of signal transduction adaptor molecule) is one such DUB. 
Through the AMSH3-ESCRT-III interaction, cargo is deubiquitinated^*. Once the ubiquitin tag 
gets removed from the cargo protein, the cargo is released into the intraluminal vesicles??. 

Cleavage of the budded membrane by fission and the process of splitting ILVs are the 
last steps involving the ESCRT machinery>*3. From previous stages, after membrane 
invagination (ESCRT-II) and maturing into a vesicle (ESCRT-III assembly), the future 
exosome is still attached to the membrane by a neck region (Fig. 2). Sequestration of cargo and 
membrane scission promote narrowing down the neck of the vesicle. Sorting of cargo is 
finalized during this step. It becomes packed into a vesicle by recruiting deubiquitination 
machinery with assistance of the ESCRT-III assembly (Fig. 3). Through these processes, the 
ESCRT machinery operates by following a “sort-pack-transport” principle. 

Secretion of exosomes occurs when vesicles with packaged cargo are released to the 
extracellular space. This is the last step in exosomal biogenesis. Once the formation of 
multivesicular bodies is completed, they can either fuse with lysosome followed by degradation 
of their cargo or fuse with the plasma membrane followed by secretion of exosomes. How the 
fate of multivesicular bodies is determined is not well-known. During the transport of 
multivesicular bodies to the plasma membrane, other molecules and interactions are actively 
involved, for example the interaction between the “molecular switches” (small GTPases) from 
the Rab family and other molecules. As has been previously mentioned, Rab family proteins 
regulate vesicular trafficking. Further, Rab27 is required for peripheral docking of 
multivesicular bodies to the plasma membrane. Specifically, Rab27a, along with its effectors 
synaptotagmin-like 4 (Slp4) and exophilin 5 (Slac2b)?5! appears to be involved in future 
exosome secretion with general tetraspanin EV markers (Alix, Tsg101, and CD63). After 
multivesicular bodies docking with the plasma membrane, the final release of exosomes is 


facilitated by the soluble N-ethylmaleimide-sensitive factor attachment protein receptors 


(SNAREs). SNAREs are a small tail-anchored protein family that mediate the fusion of 
multivesicular bodies with the target membrane and so finally lead to vesicle release. 

Alongside ESCRT dependent processes, the role of other pathways, such as ceramide 
biosynthesis pathways, should be mentioned. Complex lipids such as ceramide can be 
associated with the biogenesis of exosomes contributing to inward budding to form 
intraluminal vesicles®. Several studies have shown that an enzyme that produces ceramide 
from sphingomyelin acts as inhibitor that can reduce the number of exosomes released by 
cell$?5^, However, this is not always the case as in some cells the exosomal release was not 
affected by the synthesis of ceramide9?65, In a review on exosomes biogenesis and release, it 
has been mentioned that there is a possibility that both the ESCRT-dependent mechanism and 
ceramide biosynthesis pathways can contribute to exosome biogenesis resulting in formation 
of different subpopulations of exosomes”®. 

The biogenesis pathways of other EVs are different from that of exosomes. For example, 
microvesicles are a subtype of EVs ofa similar size to exosomes (100 nm). Due to their similar 
size, exosomes and microvesicles are described in the literature as a “mixed population"*", 
They were originally referred to as microparticles and were discovered in human plasma 
derived from platelets іп 1967!, Microvesicles can be distinguished from exosomes as they bud 
directly from the plasma membrane followed by pinching off from the membrane. Some of the 
steps are in common with those of exosome formation”, as proteins associated with Golgi 
complex and endoplasmic reticulum can be trafficked to multiple subtypes of EVs*. 
Microvesicles and exosomes can be distinguished as they have particular proteins that 


correspond to different EVs subtypes’. 


1.1.3 Exosomal cargo 


As previously mentioned in 1.1.1, exosomes are loaded with different proteins, RNAs 
(non-coding, messenger RNA, and micro-RNA), DNA, lipids, and other cellular 
components. These exosomal components are conveniently cataloged іп the 
ExoCarta database®’. Exosomal cargo may vary and depends on the physiological and 
pathological conditions of the organism. Cargo is packed during exosome biogenesis as has 
been previously described in 1.1.2. It is known that the exosomal cargo includes molecules that 
convey important information about the cell of origin?” (Fig. 1), including whether the cell 


of origin is diseased or not???, For example, viral infections, neurodegeneration (ргіопѕ”!, 
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Alzheimer’s”, Huntington’s’!), and cancer all lead to different cargo profiles compared to the 


progenitor healthy cells and tissues/??*, 


Protein profiles of exosomes have been extensively studied and reported from lymphoid 


76,77 


tumors”, sarcoma" 67, ovarian cancer’’, breast cancer 79 including various sources of biological 


82,83 84,85 


fluids, such as umbilical cord®°, placenta®!, serum/plasma??9, saliva*+*>, cerebrospinal 


fluid$9*7, and sweat?*. According to a recent review paper overviewing the characterization of 


exosomes by Lin et al®? 


, the past five years have shown a continuous growth in interest in the 
exosomes field, as supported by over a thousand published studies. Various research groups 
for the past few years have recognized the high potential of exosomes to provide novel non- 
invasive biomarkers for clinical diagnosis based on their unique biogenesis, great stability and 
diversity?-1?!, Most importantly, several studies recognized exosomes as minimally-invasive 
cancer biomarker?? and specifically exosome proteins! ?, Thus, exosomes can offer prognostic 
information based on their cargo and can therefore be considered as biomarkers and 


nanocarriers!™ 


. However, their limitation in clinical settings is explained by the difficulty of 
isolation, purification and general analysis due to the lack of standardized methods?6. 
The protein cargo of an exosome arises from the processes of exosome biogenesis 


explained above in 1.1.2 and as suggested in recent studies?>>’ 


. The physical interaction 
between membrane proteins and proteins that have been considered as general exosome 
markers, such as the tetraspanins, has been widely studied inthe exosome research 
community2224.104.105. 

Tetraspanins are a family of proteins with four transmembrane domains that can be found 
in all cell types!96, Those transmembrane domains have a unique signature that allows them to 
be differentiated from other membrane proteins. This signature is a large accumulation of 
cysteines that are present in the large extracellular loop that connects domains 3 and 4 and 


extends 3-5 nm from the plasma membrane!” 


. Tetraspanins actively interact with intracellular 
and other membrane proteins through a variety of specific molecular interactions! and 
participate in the major steps of vesicle formation and transportation!05, 

It has been confirmed that tetraspanins, specifically CD9, CD63, CD81, CD151, and 
Tspan8, are highly enriched іп exosomes. These tetraspanins are therefore frequently used as 
general markers to define exosomes from other subtypes of EVs!??. The pathway through 
which tetraspanins become recruited to exosomes has not yet been fully understood. However, 
it has been proposed that the recruitment of tetraspanins to exosomes is directly associated with 
the origin of exosomes. An initial attempt to explain the interaction of tetraspanins with 


107,110 


exosomes was made through sorting motifs , which was later replaced by interaction with 
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adaptor protein complexes, as some tetraspanins (CD9 and Tspan 8) do not rely on the sorting 
motif to integrate into exosomes. 

In addition, it is believed that small transmembrane proteins such as tetraspanins can 
function as building blocks of larger protein assemblies and participate in trafficking of 
transmembrane proteins! 9, The impact of tetraspanins оп the composition of exosomes is huge 
as they play a direct role on selective enrichment of specific protein-biomarkers that can be 
used for non-invasive diagnosis. Tetraspanins form a network with exosomal proteins and other 
proteins in the cellular microenvironment regulating signaling pathways. 

Tetraspanins are actively engaged in many biological functions of the membrane. 


According to Bari et al!!! 


, one of the important functions of tetraspanins is to regulate 
morphogenesis of membrane protrusions by altering the membrane curvature, possibly 
associated with rearranging the actin cytoskeleton. 

As has been mentioned previously, exosomes are not the only type of vesicles present in 
the extracellular space. It has been proposed that no subtype of EVs can be identified by a 
single protein marker. A combination of protein markers and biophysical techniques is 
required for characterisation of any specific EV subtype. 

The difference in proteomic profiling between exosomes, EVs, and other types of 
vesicles, such as apoptotic blebs therefore supports their route of formation as some vesicles 


(e.g., microvesicles) form outside of the membrane?9?^!?, Their cargo will be differentiated 


by their biogenesis, functions and other criteria?9!?. Besides proteins, exosome cargo also 


114-117 103,118,119 


consist of lipids and nucleic acids 
The RNA cargo in exosomes was confirmed by two independent studies that 
demonstrated transport of functional miRNA and changes in cellular behavior due to exosomal 


uptake in vitro experiments 120.12! 


. In a similar experiment, but in vivo, EVs exchange between 
tumor cells was visualized in living mice by using the Cre-LoxP system!?. Interestingly, 
subtypes of RNAs such as small non-coding RNAs including micro RNAs, can also be 
transported as a part of an EVs’ cargo. Particularly in cancer development$^!2, attention has 
fallen on the role of exosomal micro RNA124-126. It has been reported that tumor-derived 
exosomes can modify the tumor microenvironment (TME) by transferring non-coding RNAs 
and micro RNA (particularly micro RNA-21 and -29) to target cells!?’. 

The transfer of micro RNA was initially demonstrated by Pegtel et а1128 who confirmed 


delivery of viral micro RNAs via exosomes from Epstein-Barr virus infected cells into 


neighboring healthy cells that, as result, showed repression of virus-targeted genes. This 


22 


illustrates (һе general case that exosomes from а donor сей сап alter the intracellular 
environment of a recipient cel1!29.130. 

Exosome-mediated micro RNA transfer was also examined for parasite-derived 
еховошев їп vivo. In опе study, the parasitic genome was used to determine whether any of 
the micro RNA sequences that they found in the serum of infected mice came from the 
parasite!3!, Briefly, to determine whether parasite-derived exosomes can transfer micro RNA 
to mammalian cells, mouse small intestinal epithelial cells were incubated with 
Heligmosomoides polygyrus exosomes, and as a result changes in gene expression were noted. 
Specifically, two genes Dusp/ and 7133r were strongly downregulated. Interestingly, exosomes 
derived from mouse cells did not alter those genes but showed the same uptake level as 
parasite-derived exosomes. That confirmed that exosomes and associated micro RNA can 
regulate genes that are responsible for immunity by suppression of pathogenic responses and 
acting as immunomodulators. 

Although the transfer of exosomes during cell-cell interactions has been described, it is 
still difficult to define the mechanisms regulating transfer of a specific micro RNA to a 
recipient cell, including whether the transfer occurred specifically via an ехоѕоте!?2. As 
described in 1.1.2, packaging of RNA into EVs occurs during their biogenesis. Several studies 
have reported that the molecular profile of micro RNAs in EVs does not necessarily repeat the 
profile of the parental cell!33.134. However, each micro RNA profile still represents the nature 
of the cell from which it was derived, leading to a distinct set of micro RNA from different 


cell sources? 


. In addition, tetraspanins, which have been mentioned previously, were 
addressed as “scaffolds of membrane microdomains” that directly participate in loading future 
exosomes with messenger RNA and micro RNA'?*, 

Genomic DNA is also loaded into еховотев 29137 but this is less studied than RNA. 
There are not that many descriptions of how DNA gets incorporated in EVs. Interestingly, 
genomic DNA has been detected more commonly in cancer cell-derived exosomes than those 
from healthy cells'**. This raises the possibility that the DNA found in exosomes may provide 
opportunities for highly sensitive cancer detection, and reveal information about tumor-specific 
mutations that could be valuable in therapy '??. Although both single and double stranded DNA 
are found within exosomes !^', several studies propose that the majority of DNA that is 
associated with tumor-derived exosomes is double-stranded'??.141.142, 

There is a long-established interest in the lipid profiles of exosomes!?. Lipids, as well as 
proteins, play specific roles in formation and functions of exosome biogenesis'**. In general, 


exosomes are enriched іп sphingomyelin, phosphatidylserine, cholesterol, and 
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lysophosphatidylcholine accounting for about 10% of all phospholipids and 
glycosphingolipids!45:146 Lipids in exosomes are related to the lipids in their parental cells; 
however, their sorting ratio does not show equality. In one example, exosomes from two 
different cell types (mast and dendritic cells) were observed and compared. The ratio of lipids 
such as phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, 
phosphatidylinositol, and sphingomyelin were compared. According to the results, the 
composition of lipids in the exosomes from dendritic cells was 26:26:19:20 respectively while 
the combination in dendritic cells itself was 43:23:12:9. The same study showed that 
cholesterol at similar levels in the exosomes as in their parent dendritic cells. Vesicles isolated 
from the placenta showed elevated levels of cholesterol and sphingomyelin, while the level of 


phosphatidylglycerol was half that in vesicles derived from dendritic се115147-148, 


1.2 THE VALUE OF EXOSOMES 


1.2.1 The importance of exosomes 


What makes exosomes so important in the human body is their ability to facilitate 
intercommunication among cells causing specific cellular responses?! such as transferring, 
signaling, and activating receptors of cells 2249, The binding of exosomes to their target cells 
is probably determined by specific interactions of proteins enriched in the membrane of EVs, 
including tumor-derived exosomes, and proteins on the target cell!50. In addition, they can 


help to adapt to microenvironmental stresses!>!!>? 


or alter the cell itself causing pathological 
changes!53:154. According to Masaoutis et al!25 the growing interest іп exosomes is explained 
by their ability to modulate gene expression post-transcriptionally'**. For example, released 
EVs interfere with the immune system'5, forcing various immune cells, such as macrophages, 
natural killer (NK) cells, along with B and T cells, to suppress or promote tumor progression. 
Lung tumor-derived exosomes containing micro RNA-21 and -29 can activate TLR 8 in 


156-158 and can activate 


immune cells, which can lead to tumor growth followed by metastasis 
TLR 8 in immune cells!% Several studies have shown that exosomes are responsible for 
delivering immunogenic molecules causing the initiation, growth and progression of 


17,104,105 malignancy and drug resistance of tumors!33:159-161_ As described above, EVs 


cancer 
contribute to various pathological conditions. 

Another area of interest in exosome-related research is the search for biomarkers that can 

be interrogated by diagnostics that are less invasive and more rapid than current processes!23. 


Thus, from the early description of being cellular waste as described in 1.1.1, exosomes have 
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4,33,162 


become recognized as important in the chain of cellular intercommunication including 


pathogenic processes!53:154_ 


1.2.2 Review of current leading biomarkers for cancer found in exosomes 


Exosomes carry multiple cargos, which report information of the health status of the cells 


from which they were released, which is communicated to neighbouring сеП622163-165 | 


Moreover, cancer cells tend to release larger quantities of EVs compared to healthy cells! 
168 which can promote communication between tumor, tumor microenvironment, and 
metastatic sites! Cells constantly communicate with each other by releasing vesicles in the 
environment in which they grow. The crosstalk between healthy and tumor cells happens 
through the same vesicle release mechanism. According to Alipoor et aP, these multi-signal 
messengers (i.e., EVs) have an ability to educate the tumor's microenvironment and so favor 
optimal conditions for the tumor, making effective therapeutic strategies difficult. Therefore, 
unique proteins that arise from the tumor microenvironment offer a potential avenue for 
diagnosis. 

The clinical significance of exosome cargo as potential biomarkers in cancer has been 
discussed іп 1233 manuscripts up to September 20189, The importance of detection of tumors 
at an early stage has been recognized in many studies. Early detection is associated with a 
greatly improved prognosis. This underlines the value of exploring new biomarkers that can be 
detected at the earliest stages of cancer onset. Exosomes have great potential to contribute to 
our understanding of tumors development based on their molecular cargo. The variety of 
potential cancer protein biomarkers determined through proteomic approaches found on 


exosomes isolated from different biological fluids is shown in the Table 1. 
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Table 1. Potential cancer biomarkers found in exosomes secreted in different body fluids! ??. 


Cancer type Potential Cancer Biomarkers Source 

Bladder Cancer TACSTD2, EDIL-3, Mucin4, EPS8L2, a6-integrin, MUC-1, Urine 
Basigin 

Breast cancer Survivin, Survivin-2B, CEA, Tumor antigen15-3 Blood 

Colorectal cancer CEA Blood 

Lung cancer EpCAM, EGFR, CEA, LRG-1 Bloo 

Lung cancer NY-ESO-1 Serum 

Lung cancer LRGI Urine 

Lung cancer CD171, EGFR Plasma 

Melanoma CD63, Caveolinl, TYRP2, VLA-4, HSP70 Bloo 

Nasopharyngeal І.МРІ, Galectin-9, BARF-1 Blood, Saliva 

cancer 

Ovarian cancer Claudin-4, LICAM, CD24, ADAM10, EMMPRIN, TGFB1, Bloo 
MAGE3/6 

Pancreatic cancer GPC1, MIF Bloo 

Prostate cancer Transmembrane, protease, Serine2-ETS, f-catenin, PCA3, PSA, Urine 
PSMA, ITGA3, ITGBI, survivin, PTEN 

Renal cell MMP-9, EMMPRIN, Carbonic anhydrase Urine 

carcinoma 

Spongioblastoma EGFR-VIII, EGFR, PDPN, IDHI Blood 

Stomach cancer HER-2/neu, CCR6 Blood 


Meanwhile, several protein biomarkers that are present in lung cancer compared with 
normal lung tissue have been identified. For example, NY-ESO-1 protein has been found in 
serum exosomes 77, LRG1 protein in urinary exosomes!?, and CD171'^ and EGFR!” in 
plasma exosomes. There is no doubt that the protein profile of exosomes provides potential 
biomarkers with high sensitivity and selectivity for cancer. However, the small number of 
proteins isolated, often found in very low abundance, creates a substantial challenge for 
biomarker identification" 5. Not all exosome proteins are membrane-associated proteins. 
However, some of them are directly associated with NSCLC growth and cancer invasion, 
including upregulation of EGFR рговсіп 77-179, For example, the prognostic value of EGFR 
protein expression alone, which plays a critical role in the cell signaling pathways, is not as 
high. Although the role of EGFR protein and other tumor associated proteins (claudins, KRAS, 
and other RAB-family proteins)? in cell cancer іп well studied, there is not much evidence of 


ЕСЕК protein transferred by exosomes!7? 


. Despite uncertainties of exosomes’ EGFR protein, 
Huang et al!š0 concluded that EGFR protein is present іп “80 % of exosomes purified from 
lung cancer biopsies" while Zhang et al!*! showed co-dependence between exosome-regulated 
EGFR protein and metastasis in liver cancer. Proteomic analysis of exosomes will define the 


protein cargo associated with exosomes and allow evaluation of potential biomarkers of the 
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lung cancer tissue origin. On the other hand, it is a useful marker when combined with other 
specific proteins!82.183 

In addition to exosome protein biomarkers, nucleic acid biomarkers offer an avenue for 
earlier disease detection. Indeed, nucleic acids can be used in-ultrasensitive molecular 
diagnostics, but this is not without its limitations. As with protein biomarkers, the same micro 
RNA can be present in different types of tumors which results in poor outcome in diagnosis of 
a particular type of cancer. For example, an extensive investigation reported the presence of 
micro RNA-21 in several types of cancer (breast, liver, endometrial, esophageal) including 
lung cancer!?^, In particular, micro RNA exosomal biomarkers directly related to lung cancer 
have been highlighted in several studies!70-185.186 detecting 12 specific overexpressed miRNA”. 
Exosome-associated RNA has been proposed to have value as a diagnostic biomarker, 
including micro RNA-193a for С.С! and micro RNA-222-3p as for NSCLC!š, 

The information that exosome can provide may have a tremendous impact in future 
biomedical applications and particularly diagnostic techniques for cancer. To avoid the risk of 


developing diagnostics for non-selective biomarkers, a database with a set of defined 


biomarkers characterizing exosome content must be developed. 


1.2.3 Existing exosome-based diagnostics 


The increases in understanding of the origins, roles, and composition of exosomes 
described above have laid the framework for their use in diagnostics. Although exosomes and 
their molecular cargo have not been fully characterized, there are several approaches that 
exploit the use of exosomes in early cancer detection. Application of exosomes in early tumor 
diagnosis has been mentioned in several clinical trials for tumors. For example, micro RNA 
profile of exosome has been used as a complimentary tool in the diagnosis of breast сапсег! 9, 
cholangiocarcinoma and gallbladder сапсег!%, and prostate сапсег!?!. In lung cancer a 
diagnostic approach that combines exosomal RNA/DNA and cfDNA provides a sensitivity of 
89%!92, This is especially important as detection of the early stages of tumors has been an 
ongoing challenge in oncology in various types of cancer, whilst treatments applied at these 
early stages of tumor development prove the highest chances of therapeutic success. The 
applications of exosomes to cancer diagnosis have been previously mentioned in 1.2.1 -1.2.2. 
These advantages can be extended by the concept of liquid biopsies. Application of exosomes 


as liquid biopsy in clinical diagnosis of сапсег!?? holds a promise to detect potential targets 
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with minimum invasive treatment. Liquid biopsy points to the sampling and analysis of the 
biological fluids that contain nucleic acids, circulating tumor cells and exosomes. That makes 
them excellent candidates for development of new diagnostic tests. There are several studies 
that report the potential of exosomes in early tumor diagnosis. These include use of a KRAS 
mutation in circulating exosome DNA (exoDNA) for diagnosis of colorectal cancer (CRC) 
and pancreatic ductal carcinoma and ЕСЕК T790M protein with a combination of 
exoRNA/DNA and cfDNA іп NSCLC!?. Another interesting study was conducted to 
investigate the clinical value of plasma exosomes in NSCLC. This study analyzed patients 
samples from early and advanced stages of NSCLC including healthy population cohort 
investigating the correlation between Tim-3 and Galectin-9 exosome indicators!*. 

Deep learning-based surface enhanced Raman spectroscopy (SERS) has been proposed 
as a tool for cancer diagnostics. This technique analyses specific signals produced by cancer- 
derived exosomes 9^. Although results of this study resulted in 95% accuracy of diagnosis for 


195,196 


cancer patients at stage I and II, this technique is still considered relatively new and needs 


further investigation considering the heterogeneity of exosomes!%. 


1.3 LUNG CANCER 


1.3.1 Lung cancer 


Cancer is ranked in the top four causes of death in 135 countries, with an estimated 19.3 
million new cases worldwide in 2020 and approximately 10 million cancer-associated 
deaths'??, According to the World Health organization, cancer is the second most common 
leading cause of death before the age of 70 years worldwide. Globally, about 1 in 6 deaths is 
due to cancer. Although lung cancer is only the second most common cancer in both men and 
women, it is the leading cause of cancer death with an estimated 1.8 million deaths due to lung 
cancer in 2020'?, There are two major types of lung cancer: non-small cell lung cancer 
(NSCLC) and small cell lung cancer (SCLC). NSCLC is more common and a leading cause of 
cancer-related mortality in worldwide with an overall 5-year survival rate of 19.3%. The 5 - 


year survival rate for SCLC is 109420? 


. NSCLC has four stages, along with two substages for 
stages I-III. Statistics show that the frequency of diagnosis of early stages, when curative 
treatment is possible, is very low compared to the more advanced stages (Table 2). This is 
because early stages are asymptomatic and current diagnostic methods, such as biopsy and 
radiography, are unable to detect the smallest tumors. Therefore, the main way lung cancer 


is diagnosed is when it causes symptoms such as a cough or pain. By that time lung cancer has 
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already progressed to advanced stages. Once advanced stage has been reached, the rate of 
survival drastically drops. It is important to differentiate between cancer staging. While clinical 
staging is generally based on the results received after tests, for example, radiography or 
biopsy, pathological staging is based on the results after surgical procedure and therefore 


provides a better assessment on tumor development20!. 


Table 2. 5-year survival rate for non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC) by pathological 
stage. 


Stage IA IB HA HB HIA HIB IV 
NSCLC 2% 50% 43% 36% 25% 19% 7% 2% 
SCLC 702205 38% 21% 38% 18% 13% 9% 1% 


Another study reported that the 5-уеаг survival rate varies between clinical апа pathological 
stages, in which case the percentage for stage IV NSCLC survivors, for example, can be from 
2% by clinical stage to 13% by pathological stage??, NSCLC includes several subtypes of 
primary cancer, of which adenocarcinoma is the most common invasive lesion (50%). 
Squamous and large cell carcinoma account for 25% and 10 % respectively of all lung cancers 
and NSCLC tumors?", SCLC account for the remaining 15% of lung cancer cases and is 
considered an aggressive cancer associated with smoking?9. This type of cancer tends to grow 
very quickly and is characterized by an early metastatic spread. SCLC is a potentially curable 
type of cancer with a survival rate up to 25% апа consists of two stages: limited and 


extensive?06207 


1.3.2 Lung cancer and current diagnostics 


As indicated above, lung cancer develops through 4 different stages. At stage I the cancer 
is located only in one small area. Stage II means that cancer has grown but has not spread yet. 
If lung cancer is diagnosed at stage I or II, patients have more chances for survival as it has not 
spread to nearby tissues and organs. Stage III is when cancer largely develops and progresses 
to nearby tissues and lymph nodes. At stage IV, the cancer has spread to other organs of the 
body. Stage III is still an operable one; however, chances for survival decrease to almost half 


of the previous stage (Table 2). Early detection in lung cancer is especially challenging?95209, 


and consequently lung cancer and resultant metastatic disease are frequently fatal?!°. 
Diagnostic imaging is widely used for staging, following up, and assessing lung cancer 


treatment. Chest radiography, computed tomography (CT) scans, magnetic resonance imaging 
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(MRI), and positron emission tomography (PET) scans are the most common?!!, Around 80% 
of patients who present with cough, breathlessness, and chest pain receive chest radiography 
as their first routine procedure?! 27, CT and MRI scans can image the thoracic cavity more 
effectively than chest radiography, identifying specific features and therefore can be effectively 
employed in detection. Combining PET and CT scans can improve diagnostic imaging by 
reducing false-positive interpretation from an individual imaging technique?P. 

Additional tests for lung cancer include examination ofthe lungs lining by direct invasive 
techniques, such as bronchoscopy, resection biopsy, and transthoracic needle aspiration?!42!5 
can provide an accurate diagnosis of the tumor. Other types of tissue diagnosis are sputum 
cytology and pleural fluid cytology (thoracentesis). Both are non-invasive methods and are 


often used in clinical staging but require a follow-up test if results have been negative in 


response to cancer markers. 


An overview of current diagnostics methods reviewing their potential advantages and 


disadvantages are presented in the Table 3. 


Table 3. Advantages and disadvantages of current diagnostics methods. Most common diagnostic methods for lung cancer 
are marked by asterisk * 


Test 


What does it show 


Advantages 


Disadvantages 


Chest X-ray* 


Images of the lungs and 
surrounding tissues 


Commonly available; 75.4% 
sensitivity at 1 year (study was 
based on individuals >50 years 
old?!5; unexpensive?? 


The nodule size detection is limited 
to 1 to 2 cm; low visibility due to 
other spiculated masses from 
previous complications; false- 
positive rate 9-159927; requires 
follow-up imaging2!š 


low-dose spiral 
CT(LDCT)* 


3-D views of the lungs 


Resolution capabilities under less 
than 1 cm; better contrast 
resolution (compared to chest X- 
ray); evaluation of the entire 2!9 


High false-positive rates 20-50%; 
limited detection of cancer on early 
stages over detection of already 
existing tumor in later stages Ш- 
IV218 


Lab tests 
(ctDNA, NGS) 


Mutations that occur in a 
cancer cells’ genome 


87% sensitivity and 81% 
specificity on the stages II-III; 
non-invasive220.221 


Low sensitivity іп stage І (15%); 
limited by cost”??? 


PET/CT* 


Detailed soft tissues 
presented in cross- 
sectional images 


Greater resolution of nodules and 
tumor pathology (compared to 
chest X-ray and CT); more 
accurate than CT in the detection 
or exclusion of mediastinal nodal 


metastases2!š 


High false-positive rate (up to 
50%); at least 20% аге over 
diagnosed; inability to locate the 
tumor due to the blurry images?!; 
cost-effective 


EBUS 


Cell abnormalities 


Direct visualization of nodes; 
high sensitivity (85%); real-time 
sampling (less risk of major 
vessel puncture); guided 
sampling?!5224 


Less core tissue for sampling; low 
negative predictive value; cannot, 
image or sample subaortic and 
paraoesophageal lymph nodes; 
repeated invasive sampling in early 
stages of cancer??? 


Biopsy 


Cell abnormalities 


Rapid, minimally invasive, and 
economical test, liquid biopsy 
presents oncologists with real- 
time information about а 
tumor?!^ 


Lack of standardization around 
procedures; difficulty in comparing 
different technology platforms; 
lower sensitivity (stage I)?!^ 
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Bronchoscopy Cell abnormalities 


Abilities in investigate area out of 
reach (smaller bronchi; Higher 
sensitivity of 85-87% when 
lesions were >20 mm and an 
overall sensitivity of 69.7%; 
accuracy 89%205215 


Bleeding (0.12%), hypoxia, loss of 
airway, pneumothorax (0.1- 
0.16%), or mortality (0-0.02%)226; 
sensitivity depends on the size and 
position; repeated invasive 
sampling in early stages of cancer of 
the tumor (34-88%)? 


MRI* Detailed images of soft 
tissues in the body 


Better soft tissue contrast, 
multiplanar imaging capability 
(over СТ)????%, can quickly 
recognize nodule between 1-2 cm 


Slower and more expensive with 
poorer spatial resolution (compared 
to СТ)2!*; inability to recognize 
nodules less than 0.4 cm229230 


with 8296 specificity?29250 


Sputum Cell abnormalities Combining with quantitative Squamous cell lung cancer only 
cytology image cytometry can lead to a 

specificity of 90% and sensitivity 

of 80942! 


1.3.3 Failure of current diagnostics in detection of lung cancer 


Improved detection leads to better knowledge of lung cancer progression. Despite current 


cancer treatment therapies (e.g. chemotherapy, radiotherapy, surgery)!2?26209 


and good 
diagnostic tools (e.g. sputum cytology, chest radiography (and later on low-dose spiral 
computer tomography (CT), and tissue sample (biopsy))2%? lung cancer still remains the leading 
cause of cancer-related deaths around the world3-205232_ The reason for this is inability to detect 
lung cancer early enough so that the correct treatment can be chosen. Most available 
diagnostics described above lack sensitivity and have other limitations. 

Diagnostic imaging has been described in many comparative studies concluding that the 
tumor evaluation and treatment depends on a type of lung cancer. For example, CT and MRI 
have poorer outcomes in identifying tumors, while PET scans are able to confirm NSCLC in 
83% of the cases?!?. Although PET scans have proven their clinical values in some cases, the 
application of PET itself in identifying primary malignancies remains controversial due to 


false-positive findings and resolution???2* 


. PET scans as a molecular imaging technique that 
observe malignant cells and measure the cellular glucose uptake. The glycolytic rate is 
visualized by using !*F-fluorodeoxyglucose. One of the important limitations of PET scans is 
difficulty and sometimes their inability to precisely localize the tumor due to the blurry images 
of anatomical structures or indirect inflammation». Besides that, PET is a very expensive 
diagnostic method, which is a major drawback that needs further improvement236. 

A comparative study has been conducted on almost 12,000 healthy individuals to find 
out the performance of MRI in the detection of lung cancer and its ability to define 
asymptomatic patients??". Results of the study showed that 4.8% had suspicious nodules, and 


ultimately only 0.496 of the cohort had lung cancer confirmed. MRI was evaluated in another 
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study that discussed relevance of the size for malignant tumors. All nodules less than 4 mm 
and smaller showed inconclusive results, while nodules between 10 and 20 mm were 
recognizable as the malignancy of masses showing 28% to 82% respectively?29, MRI 
imaging can be compromised by respiratory movements by forming masses that can be 
mistakenly assumed as tumor masses238. Nonetheless, MRI has advantages over CT scans 
delivering results in multiplanar format and having the ability to very quickly determine 
existing tumors?2722$, 

Another approach is the Saccomanno technique, which involves collection of sputum 
samples. It has been widely used and showed positive results?!. However, when this method 
was compared to fresh smears from lung cancer patients, the Saccomanno method was not able 
to identify small cell carcinoma while the second method (fresh smears) did??. Sputum 
cytology is still a useful diagnostic test and in some countries is used more often than biopsy?*^- 
242. 

Surgical lung сапсег diagnosis is undertaken іп early stages of cancer to reduce 
morbidity. The most common procedure is a standard resection that is often combined with 
chemo/radiotherapy??. However, several studies have reported that tumor masses often 
confirmed as benign tumors and do not always associate with lung cancer?“+. That raises the 
question about unnecessary surgical procedures that could potentially result in medical 
complications to a patient needing months of long recovery. The paradox of preoperative and 
no preoperative tissue diagnosis was explained іп Sihoe's et al. study?^. In this study, the 
surgery was performed in 443 patients with an early confirmed or suspected lung cancer mass. 
Another 206 (46.5%) patients did not have preoperatively confirmed diagnosis. Among 
patients without preoperative diagnosis only 8 % were found to have a benign tumor. In 
addition, 52.996 of the cases suspected to be tumors on the basis of preoperative diagnosis were 
not confirmed as lung cancer. The study concluded that proceeding with surgery without 
preoperative diagnosis is safe, but only in the case of patients who have a suspicious lung mass 
already identified. Therefore, new methods for earlier lung cancer detection 


аге essentia]?05245246. 


1.3.4 New diagnostic methods under development 


One recently developed method of diagnosis looks at the distinctive appearance of gene 
expression from bronchial epithelial cells that shows differences between the gene expression 


profile in healthy and cancer cells. The study proposed a gene expression classifier consisting 
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оҒа combination of 17 сапсег genes (from ап initially identified group of 232 genes). Тһе gene 
classifier panel was then used alongside the bronchoscopy procedure to increase sensitivity”. 
Another novel concept that increases sensitivity of lung cancer detection is radiomics. 
Radiomics combines data from several diagnostics (CT, PET, MRI) allowing verification of 
diagnosis and choosing treatments more effectively?^*. The idea behind this method is to detect 
the tumor and define the correct tumor phenotype, which further can improve clinica. 


decision making?^. 


Detection of an abnormal tumor antigen can be another way of diagnosing a tumor, even 


in early stages. This method is based on producing a signal by autoantibodies in response of 
detection tumor masses. This method seems to be applicable on all stages of lung cancer based 
on identified panel of autoantibodies. The initial clinical study to test the sensitivity and 
specificity of this method was conducted in 2011 showing high sensitivity up to 9396250, 
However, the sensitivity was later reported being around 37-4096 which is very low and needs 
to be improved for the overall test accuracy??!. 

Characterization of the complement cascade and specifically analyzing the level of 
plasma circulating complement component (C4d) could be an alternative way of detecting lung 
cancer??, However, in patients who appear to have emphysema and COPD this test didn’t 
show significant elevation and therefore cannot be used in people who are at risk of lung cancer. 
Results of this test also showed great specificity (89%), but just as a previous test with AAbs 
it resulted in low sensitivity in a very close values of 44%253. 

Evidence suggested that tumor suppressor genes can be detected through abnormally 
methylated DNA circulating in the serum of lung cancer patients іп all tumor 542668254, Several 
studies have specifically tested this method on patients in stages I and II. Results from 
investigations into differences in DNA methylation of a range of markers between lung cancer 
patients and non-cancer controls were somewhat similar and were compared by reported 
percentage of the sensitivity and specificity in plasma and sputum. The final numbers for 
sensitivity and specificity in plasma were 62% to 76% and 74% to 84% respectively, and 63% 
to 86% and 72% to 92% in sputum, respectively255-257_ 

Real-time endobronchial ultrasounds (EBUS) has been developed to guide the 
established technique of transbronchial needle aspiration (TBNA) that offers a higher 
sensitivity using a larger scanning гапре?!5, The accuracy and performance of EBUS-TBNA 
has been studied by several research groups. A study proposed by Navani et al??* recruited 
patients that had already undergone initial diagnosis by CT scans and were suspected to have 


lung cancer (stages I to IIIA). Following the primary goal of defining the time-to-treatment 
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decision, this randomized trial concluded that detection of tumor was faster by EBUS-TBNA 
(14 days) compared to CT scans and staging diagnosis (29 days). For this reason, they propose 
that EBUS-TBNA should be considered as the initial method of lung cancer investigation. A 
year later, another study concluded that EBUS-TBNA reduces the time for diagnosis if it 
chosen as the initial test?*. Analyzes of cytological samples collected by EBUS-TBNA also 
showed reassuring results, declaring the sensitivity and specificity reached 94.52% and 
95.12%, respectively???. Minor concerns to sensitivity mentioned in this study were associated 
with a lymph node size threshold”. In addition, EBUS-TBNA is less invasive than surgery 
and recommended for patients who need an initial staging?S!. 

Circulating tumor DNA (ctDNA) and next-generation sequencing (NGS) of ctDNA have 
been recognized as sensitive and non-invasive tools for detection of early stages of cancer262- 
264 The role of ctDNA as a biomarker of advanced stages of lung cancer has been already 
proven??, However, the evidence of its application in early stages needs more proof due to a 
low sensitivity that is 15% at stage I compared to 100% at stages II-IIP?222, A newly proposed 
way of defining biomarkers, NGS, is based on genomic alteration in the tumor. Interestingly, 
a study that included over 400 patients, where half of them were patients were without known 
cancer, has been reported with the same output as in the previous mentioned ctDNA studies. 
Further, the data indicated that the proportion of cancer patients with ctDNA was rather large 
with different cancer types than in patients without known сапсег?65, Although the presence of 
ctDNA in early stages of cancer has been reported?®?®, tumor detection has to be achieved 
against a background of non-tumor derived ctDNA. The low level of ctDNA results in a lot of 
unnecessary data for analysis, such as signal to noise ratio, and becomes computationally 
challenging making it incredibly difficult to apply this method for routine screening and 
especially for early stages221267_ As a result, this method seems to be more appropriate for more 


advanced stages. Cohen et а1265 


proposed combining this method with protein biomarkers 
(protein) for a stronger sensitivity in detection. 

Raman spectroscopy combined with principal component analysis (PCA) has been 
further investigated?®* after it attracted interest in medical diagnoses of nasopharyngeal 
cancer?®, head and песК270 and lung cancer?’!. Several studies that have applied Raman 
spectroscopic analysis for in vivo early lung cancer detection reported that sensitivity and 
specificity in detection of precancerous lesions were acceptable and could differentiate 
between malignant, benign, and normal lung tissue?” 


Apart from nucleic acid sequencing and Raman spectroscopy, mass-based proteomic 


analysis, tandem-mass-tag (TMT)!”° and triple stable isotope labeling by/with amino acids in 
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cell culture (SILAC) quantitative proteomics approach аге becoming more applicable іп 
diagnostics??. Those will be described in Chapter 3 (subchapter 3.1.2) 

In summary, there are no current or developing methods that could reliably give an early 
diagnosis. Existing diagnostics are invasive and noneffective. On the other hand, the concept 
of studying breath exosomes can open a new non-invasive and more effective avenue for 
diagnosis. Breath exosomes collected from exhaled breath condensate (EBC) could be a better 
option for biomarkers as they are largely free of contaminants that whole EBC struggles with. 
Despite the promise of new techniques, challenges in diagnosis of early-stage tumors remain 


high. Therefore, new methods of non-invasive diagnostics for early stages are highly desirable. 


1.4 BREATH-DERIVED EXOSOMES 


1.4.1 How breath-derived exosomes are characterized 


In general, exosomes have been divided into two groups: cell-derived exosomes and 
exosomes “derived from body fluids (plasma, urine, spinal fluid, saliva, and etc)” 126, However, 
another important source of exosomes - breath - has been largely ignored. Although the 


potential of exosomes obtained from EBC has been considered! !^274276 


, their application has 
not been extensively explored. Whilst there are several studies that have reported biomarkers 
in EBC, in general these have used either crude ЕВС!72174 ог exosome-enriched 
fractions!14274275 [n the study by Youssef at al^, exosome-enriched fractions from airways 
obtained from bronchoalveolar lavage fluid collected from 43 asthmatic patients and 42 
healthy individuals were analyzed. The study confirmed no significant differences between 
groups in the proportion of extracellular vesicles that contained the general tetraspanin marker 
CD63. However, the expression levels of other markers (1.е., CD54 and HLA-DR) were 
significantly higher in the diseased group. In addition, the analysis of lipid signatures of 
asthmatic patients showed significant differences (р <0.05) in the level of 
phosphoglycerolipids and sphingolipids compared to healthy group''* supporting the 
sphingomyelinase inhibitors that can reduce the number of exosomes release by cell. In the 
Sinha et a? study 20 EBC samples collected from 10 asthmatic and 10 healthy individuals 
were compared with the goal of finding disease signature in micro RNA. As a result, 11 micro 
RNA markers were noticed as differently expressed in the diseased group compared to healthy 
individuals. Amongst these 11, 3 (hsa-miR-574-5p, hsa-miR-516a-5p, and hsa-miR-421)?” 
had been previously reported in association with asthma. Interestingly, they showed that the 


majority of the identified exosomal micro RNA is similar to that identified in EBC suggesting 
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that exosomes obtained from EBC contain microRNAs that can act as biomarkers. However it 
was not a focus of their work to characterize or confirm the purity of the exosomes that they 
obtained through immunoaffinity isolation. 

It is expected that exosomes isolated from EBC will be particularly rich in those 
originating from lung tissue and, through biomarker analysis, reveal the status of lung health. 
For example, these breath-derived exosomes may carry distinctive molecular biomarkers of 
lung cancer, a disease where early detection is particularly challenging but urgently required 


to enhance overall survival for lung cancer patients?05245246278-280 


1.4.2 Biomarkers in exhaled breath condensate 


Exhaled breath condensate (EBC) is a promising source of biomarkers for monitoring 
the lung and respiratory system. Collection of EBC is non-invasive and requires tidal breathing 
into a cooled condenser or a collection vessel. In comparison, collection of bronchoalveolar 
lavage for biomarker analysis is substantially more invasive and the level of biomarkers found 
in EBC сап be much higher than in bronchoalveolar lavage?*!. There has been significant 


growth in EBC research in recent years to identify biomarkers for diseases including chronic 


ТЕ 286—288 


obstructive pulmonary disease (COPD and asthma 

Investigations related to genomic DNA mutations in EBC (specifically in the p53, EGFR, 
p16, and KRAS genes) were reported by several groups289-291_ Since then more investigations 
have been done and detection of small amounts of DNA in EBC samples has been reported??? 
294. Cancer-associated mutations in DNA were extensively researched by Youssef’s group 
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using next-generation sequencing””>. In their first study, the researchers studied mutations of 


22 cancer genes in a healthy population. This study suggests that EBC could be adopted for 
lung cancer detection in its earlystage??. In their second study, following the same 
methodology, EBC collected from patients diagnosed with various types of lung cancer and 
group of healthy population was compared. As anticipated, some differences in the level of 
DNA mutations between groups were noticeable. In order to analyze whether there are 
differences in cancer-associated mutations in DNA before and after treatment (chemotherapy), 
EBC of 8 lung cancer diagnosed patients was collected before and after treatment. 
Unfortunately, after conducting statistical analysis the difference was not statistically 
significant (р-0.83), although the number of mutations in the treated group was lower 


compared to the non-treated patients2%6_ 
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ЕВС as a source of biomarkers has been extensively researched for its clinical usefulness 
especially for early stage lung cancer detection297 304 Examples of previously identified breath- 


based biomarkers for lung cancer can be found in Table 4. 


Table 4. Breath-based biomarkers for lung cancer detection. 


Cytokines DNA, protein, peptides Oxidative stress Others 

Interleukin-2305 S100A11, ANXA1, ENOI, FABP5*°? Hydrogen peroxide??? [ipid 
peroxidation?05209 

Interleukin-6?!? Let-7304 Nitric oxide??? 

Tumor necrosis factor-a КЕТІ, KRT6A, KRT6B, KRT6C, 8-isoprostane?!! 


KRT9, КЕТ10%2 
3p microsatellite alteration: 


57922312 


p53 mutations?? 


Endothelin- 17° 


Leptin? 


A recent study used a proteomic approach in exploring lung cancer biomarkers in EBC 
from 20 individuals (n=10 diseased and n=10 healthy). They reported identification of a total 
1151 proteins from which four (5100А11, ANXAI, ENOI, ЕАВР5) (Table 4) were identified 
as novel biomarkers for lungcancer?, 

Although it seems that EBC could potentially offer biomarkers for several diseases, 
diagnostic accuracy of its use in needs to be improved. For example, detection of COPD based 
on previously identified EBC inflammatory markers remains challenging and health history or 
concurrent diseases need to be considered??. Another study in which EBC biomarkers were 
compared to bronchoalveolar lavage biomarkers showed no significant correlation?! 

Some limitations in defining biomarkers from EBC include concentration of sample, low 
limit of sensitivity (due to being highly diluted), and contaminants, such as saliva and volatile 
organic compounds. On another hand, exosomes derived from EBC could be a better option 
for biomarkers than whole EBC, as exosomes can be purified away from the contaminants that 


can be present in whole EBC. 
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1.5 AIMS AND OBJECTIVES OF THE PROJECT 


Aim of the project: 

The aims of this project were to characterize exosomes collected from exhaled breath 
condensate using physicochemical and proteomic methods, to use cell culture models to 
identify putative biomarkers of lung cancer found on exosomes, and to develop a proof-of- 


principle test for these biomarkers. 


Objectives: 
1. Optimize methodology to isolate exosomes from cell models of normal lung tissue (WI-38 
cells) and lung cancer (A549 cells) to identify the proteins found in these exosomes and to 


propose biomarkers of lung cancer using proteomic methodology. 


2. Isolate exosomes from human breath and identify the proteins found in these exosomes 


using proteomic methodology. 


3. Define the proteome of exosomes from cell models of normal lung tissue (WI-38 cells) and 
lung cancer (A549 cells) to demonstrate that it is possible to identify potential biomarkers 
of lung cancer and evaluate whether those may be revealed on exosomes to develop a proof- 
of-principle test using cell culture model. Secondly, determine the benchmark proteome 
from breath-derived exosomes and use those exosomes as a negative control to test the 


proposed putative biomarkers of lung disease. 
4. Develop a sensitive FRET-based assay to detect interaction between exosomes and 


proposed putative lung cancer biomarkers with a final goal to distinguish biomarkers 


between cancer and non-tumorigenic exosomes. 
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CHAPTER 2. ISOLATION AND QUANTIFICATION OF CELL- 
AND BREATH-DERIVED EXOSOMES 


Aim: 
The aim of this chapter was to describe the isolation of cell- and breath-derived exosomes and 


their characterization using physicochemical methods. 


Objectives: 

1. Optimize methodology to isolate exosomes from cell models of normal lung tissue 
(WI-38 cells) and lung cancer (A549 cells) that are sufficiently high quality from cell- 
based sources. 

2. Apply optimized methodology to isolate exosomes from human breath and undertake 


phenotyping of breath exosomes by comparing them to cell-derived exosomes. 


2.1 INTRODUCTION 


2.1.1 Importance of purity and quality of exosomes 


Since their first discovery, exosomes have attracted the attention of many researchers, 
increasing the size of researchers in the EVs field. The origin of exosomes and some of their 
uses were described in Chapter 1. There are extensive further applications that are beyond the 
scope of this thesis. However, these additional uses have been extensively reviewed? !^?1?, 

The previous chapters explained that EVs, including exosomes, can be found in many 
biological fluids and tissue samples. In order to claim the presence of EVs, the isolate must be 
an extracellular fluid, which in this case can be either a biological fluid or conditioned cell 
culture medium. It is important that collection of EVs, including exosomes, should be 
"optimal" - designed to avoid any mechanical disruption. Despite the potential of exosomes, 
there are many obstacles that limit their application. Although methods of exosome isolation 
are generally easy, current techniques аге costly and only obtain low yields and purity?2022!, 
Today, several techniques have been offered for isolation, characterization, and analysis of the 
functional properties of exosomes that will be described later in this chapter. 

Application of exosomes requires an ability to differentiate them from other subtypes of 
EVs. A motivation for developing guidelines for studies involving exosomes was based on 


ensuring reproducibility. Therefore a list of minimal information for studies of extracellular 


vesicles (MISEV) was published in 2014 and was based on expertise of International Society 
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of Extracellular Vesicles (ISEV) board members actively involved in exosome research 227, 
This document provided basic, yet critical, information, such as methods of isolation and 
characterization. With rapidly growing interest in this research area, many uncertainties and 
issues seem to appear quickly and require further clarification, especially with respect to 
exosomes. After a few years, an updated document — MISEV 2018, was produced by the EV 
community (7380 authors)’. The updated minimal requirement summarized, updated, and 
clarified the most important points from MISEV2014. 

Further, the requirements based on mammalian EVs were corrected and clarified on pre- 
analytical phase. That included specification on detailed culture and harvesting condition (e.g., 
passage number, density, and culture volume), medium preparation (e.g., exosomes-depleted 
serum), storage, separation and concentration, quantification including total proteins, lipids, 
and RNA. The conclusion of the minimal requirements updates recommended use of the 
preferred generic term "extracellular vesicles" unless the vesicles are defined by biochemical 
characteristics matching a particular subtype. According to MISEV 20189? , an EV will be 
called an exosome only after showing “at least three positive markers of EVs, including at least 
one cytosolic protein and transmembrane/lipid-bound protein; at least one negative marker; 
and characterized by two different but complimentary techniques (for example, transmission 
electron microscopy (TEM) and dynamic light scattering (DLS))". In addition, isolated EVs 
have to be evaluated for their purity as non-EV structures (e.g., lipoproteins??) can be present 
potentially interfering with characterization. For example, to be characterized as exosomes, 
proteins associated with Golgi apparatus, endoplasmic reticulum, mitochondria, 
autophagosomes or nuclear components must not be expected as they are present in larger 
EVs*®, 

Methodological approaches for isolation and quantification of exosomes, their proteomic 
analysis and finally detection are critical for choosing a biomarker for diagnostic purposes and 


will be discussed in the next subchapters. 


2.2 LITERATURE REVIEW OF EXOSOME ISOLATION METHODS 
2.2.1 Differential ultracentrifugation (UC) 


The most common method of exosomes isolation is differential ultracentrifugation (UC). 
UC has been used for decades and still remains as a standard method of isolation324-326_ This 


widely employed technique??? has the advantage of minimal sample manipulation. The 
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collected source of biological fluid сап be simply transferred to ап appropriate centrifugation 
tube shortly after collection and proceed following a chain of sequential steps with increasing 
centrifugation force as well as duration. This method is applicable for large sample volumes, 
such as conditioned cell culture medium, as small samples will result in а luck of purity330331. 

However, the efficiency of the method is lacking the purity of vesicles, not even 
mentioning being time-consuming and labor-intensive. Livshits et al? present a very 
interesting theoretical analysis of exosomes isolation by differential centrifugation. They base 
his conception on the type of rotor that is used in common isolation protocol and the importance 
of "cut-off" sizes for vesicles. His analytical explanation is based on the theory that smaller 
particles, which presence are fewer compared to larger particles, may finish their migration 
together with larger particles. The issue with that is that the acceleration force (which governs 
migration rate) is dependent on particle size, with larger vesicles being subject to greater forces, 
and so is offset by larger vesicles being retarded more by the ultracentrifugation medium. 
Another obstacle is co-precipitated aggregates, lipoproteins, and contaminants??, Other 
research groups reported that even with a prolonged UC cycle including overnight??, 
impurities were still observed іп exosome samples**+. Théry et al? suggested to dilute the 


sample with an equal volume of the buffer (1x PBS) if the original sample has a high viscosity. 


2.2.2 Size exclusion chromatography (SEC) 


Size exclusion chromatography (SEC) is equally often used, and preferable to UC as it 
allows separation of exosomes while keeping their original shape**>. This method allows rapid 
and gentle isolation of EVs based on their size by using prepacked columns with porous bead 
resins335. The order of exosome elution from the column depends on their hydrodynamic 
radius. This method has several advantages, such as being less time consuming and needing 
less sample volume than in UC method. Vesicles isolated by SEC also experience minimal 
damage and deformation and do not require specialized equipment and space to conduct the 
process?*6, Further, SEC is applicable for isolation of exosomes from a variety of biological 
fluids. Unfortunately, SEC leaves the purified exosomes in a diluted state, necessitating further 
manipulation for some applications??". For example, while it always works for cell culture 
conditioned medium, it seems that the presence of lipoproteins in EVs samples remains 
challenging for samples from human plasma??". Several attempts optimizing the SEC method 
for human plasma included plasma dilution to reduce the viscosity and prolonged cycles of 


UC*8, optimization the CCM by adapting it to a reduced FBS or FBS-free medium™, and 
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finally switching to exosomes-depleted FBS339340. In addition, Guerreiro et a^! proposed that 
using ultrafiltration prior to SEC allows to separate exosomes from many of the smaller 
proteins іп the sample. A study by Davis et al? also confirms that using SEC method helps to 


avoid unnecessary impurities and best suited of all commonly applied isolation techniques. 


2.2.3 Density gradient centrifugation 


Density gradient centrifugation is mostly used after a crude EV population had been 
isolated?*, but is also applicable for viruses and lipoproteins. Previously, it has been mentioned 
that UC causes deformation and even breakage of vesicles and accumulation of other 
aggregates during the process. Density gradient centrifugation is highly effective in separation 
of vesicles without deformation, even though the gradient undergoes ultracentrifugation for a 
longer duration 16 h?** or 24 Һ55 and at higher centrifugal force (210,000xg) than UC. 

The medium for density gradient isolation frequently includes a sucrose or iodixanol 
(OptiPrep) gradient. According to Chen et al*“°, sucrose gradients achieves high level of purity 
that other methods are not able to provide. This method follows the same steps as standard 
ultracentrifugation, but with an addition of the sucrose cushion in the last step. That being said, 
there are several disadvantages. For example, this method is not suitable for low sample 
volumes as it requires several steps of centrifugation and can occasionally cause breakage of 
уеѕіс1е5227347; moreover, it is a time-consuming method as the last step includes 24 h of 
ultracentrifugation??! that can prolong isolation for 2 days. Another disadvantage is a high 
cost327.348 that prevents many research group even to consider it. In addition, it is not suitable 
for downstream applications (protein and RNA profiling?^) and requires a verification of the 
absence of co-isolated materials??? 

The outcome of using density gradient separation over other methods often prevailed in 


351. However, it has been 


plasma-derived exosomes based on their size similar to lipoproteins 
shown that combining density gradient centrifugation with other purification methods leads to 


greater degrees of exosome purification?*. 


2.2.4. Immunoaffinity capture 


This method uses magnetic microbeads (streptavidin-modified are most common) with 
which exosomes get incubated with. Briefly, after incubation with magnetic beads, they 


recover by rinsing a column or magnetically separating exosomes microbead complex, washing 
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them in 1XPBS solution, centrifuging at the similar force as final step for UC method, eluting 
with a special buffer or even lysing with buffer for electrophoresis and further analysis??? 
Challenges in this method show the importance of establishing the correct ratio between 
antibodies, exosomes, and beads in characterization of melanoma cell-derived exosome*”’. 
However, as it shown in this study, it is possible to isolate specific exosomes based on a defined 
target, leaving unwanted vesicles aside. Other studies also confirm the immunoaffinity capture 
method allows to isolate subpopulation of EVs without contamination and with “retained 
biological activities”?5*355, The same studies proposed that the immunoaffinity method coupled 
with other techniques involving physical separation or other isolation procedures gives a high 
purity of the final exosome elution. At the end, application of this method allows to eliminate 


large aggregates including unwanted proteins and even lipids??6. 


2.2.5 Filtration-based techniques 


Filtration techniques usually play an additional role to the other methods described in 
this section and is not used as a main method of isolation. Filtration is based on separation of 
vesicles based on their size or molecular weight, eliminating particles that are not included in 
the size-range of the vesicles of interest. Filtration can be used at any stage of isolation. For 
example, in the beginning of the process using media enriched in vesicles to remove larger 
vesicles and cell debris, and at the final step to ensure the presence of a particular size of vesicle 
in the solution357. Most often, filtration accompanies UC (рІаѕта?°%, uterine aspirates359, 
urine*™, and other different biological fluids?!) and SEC (CCM5336362-364 and plasma?^!). 

Membranes with pore diameters of 0.8, 0.45, 0.22, and 0.1 um are most common in the 
isolation process??. Application of filtration method for exosomes isolation alone is still 
remaining challenging. Very often, vesicles get stuck to the membrane filter which results in 
their 1055260 causing filter plugging and deformation of filters335357360 Some proteins also 
adhere to the membrane, which results in loosing potential proteomic information*®. 

Functional studies by Benedikter et a? have introduced a 0.22 um filtration step aiming 
to concentrate and purify small EVs (sEVs). The same study also concluded that using 10 kDa 
filters show completely recovery of vesicles at a much higher concentration that when using 
100 kDa filters, where recovery was only 409679539. On the other hand, several studies reported 
challenges where isolation of exosomes using a filtration method was not efficient. For 
example, the cryo-SEM microscopy analysis conducted by Heath et al*°° showed lipid droplets 


and other large contaminants combined with cell-derived EVs samples isolated by tangential 
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flow filtration. In contrast, another research group confirmed highly purified exosomes could 
be isolated by sequential filtration, which generated exosomes free from protein aggregates333, 
A notable advantage of tangential flow filtration is that it works well for large volume samples 


as has been reported by Buscatto et aP*", 


2.2.6 Precipitation-based techniques 


Precipitation methods have been designed for their simplicity. They usually include 
adding a solution that contains an organic solvent such as polyethylene glycol (PEG) or 
acetone, then quick centrifugation at low speed (3,000-5 ,000х е), and a wash with 70% ethanol. 
Garcia-Romero et al found exosome isolation by PEG is a quick and low cost technique that 
provides high yield of EVs suitable for future biomarker discovery. A two-step PEG- 
precipitation was found to more successfully purify exosomes than other methods?9?, Rider et 
al? suggested that a higher PEG concentration than is conventionally used can increase 
precipitation of smaller vesicles and is therefore useful for isolation of exosomes. Further, the 
experiment was done on different biological fluids including plasma, saliva, urine, and CSF 
showing not only a high yield with retention of biological activities. 

Interestingly, another research group?! suggested that precipitation methods by PEG and 
PROSPR (Protein Organic Solvent PRecipitation) are ideal for biomarker discovery and 
specifically for therapeutic uses as they are both capable of removing abundant proteins from 
the sample. However, the same study reported that they were unable to image exosomes 
isolated by the PEG method. An attempt to isolate exosomes by PEG followed by washing and 


pelleting by UC was performed by Ludwig et a6? 


on both small and large scales. The goal of 
their research work was to test the efficacy of the PEG method in concentrating sEVs. In order 
to prove the efficiency of this method, further investigation in detection of residual PEG 
particles in the final samples was performed by barium-iodide staining techniques. As a result, 
it had been confirmed that all PEG residues were washed out making this technique suitable 
for sEVs enrichment. Despite retaining their functional activities after PEG precipitation, 
those sEVs are not recommended for molecular analysis due to their low purity. In addition, 
the same study concluded that PEG precipitation is unable to separate different type 
of sEVs within the sample. It has been previously reported that molecular weight of PEG can 
affect the precipitation process??? 


1873 


A recent report by Gallart-Palau et al?” presented an experimental workflow for isolating 


enriched EVs from the CNS using PROSPR method. According to their results, the percentage 
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ofisolated EVs revealed that somewhere around 80% of observed vesicles were exosomes and 
75% of identified proteins reported to be associated directly with exosomes. Their results 
confirm that PROSPR method is applicable to exosome isolation and can be easily and 
inexpensively performed and offers potential in clinical settings. On the other hand, there are 
a few limitations. Using organic solvents may interfere with the functional properties of EVs 


causing their fusion37! 


and won't be suitable for a large sample volume. Although several 
studies have proven that PROSPR can be successfully used in exosome isolation and 
specifically with an increased concentration of exosome proteins. However, identifying tumor- 
specific proteins still remains challenging and application of PROSPR to this end needs further 
validation’. 

Charge-based precipitation coupled with PEG confirmed successful recovery of vesicles 
from small volume samples when tested on different biological fluids including saliva and 
human liver stem cells374. This study used various doses of protamine to induce exosomes 
isolation and evaluate the ability of EVs to retain their biological activities after isolation. 
Another research group led by Klymiuk et al>”> that applied the same method, but on an equine 
mesenchymal stem cell line reported vesicles aggregation, which didn’t seem to be an issue in 
a previous study. 

A novel “salting-out” technique proposed by Brownlee et al?” works by mixing the 
sample with 0.1 M acetate buffer at pH 7.4 allows to rapid isolation fast without specialized 
equipment. Their study confirmed that salted-out EVs are indistinguishable from those 
obtained by well-known and preferable method ultracentrifugation. The efficiency of this 


1377 and Sáenz-Cuesta et а/?75, However, vesicle 


method was confirmed by Kotmakci et a 
aggregation, contamination with non-EV proteins, the presence of polymers, and the need for 
an extra purification step are limitations of this method??997, 

The one-step precipitation method from different biological fluids is also used in several 
commercial EV isolation kits, such as ExoQuick (System Biosciences), Exo-Spin (Cell 


Guidance Systems), Total Exosome Isolation (Invitrogen), and ExoPrep (HansaBioMed). 


2.2.7 Microfluidic technologies 


Microfluidic methods are relatively fastand efficient, but costly and require 


87. 


special equipment These methods operate on a micrometre scale and use combined 


strategies that allow efficient isolation. Currently, there are several devices that can be coupled 


45 


381,382 


with as electrophoresis (blood and plasma samples)**°, magnetism , applied acoustics?5? 


(whole blood), and immunoaffinity?**, 

A microfluidic system with an integrated acoustic-fluidic device is able to separate 
exosomes from other vesicles based on their size, as confirmed by nanoparticle tracking 
analysis (NTA)383, It requires as little as 50 uL of ће sample*®. In addition, the same study 
was able to demonstrate the removal of unwanted blood components, such as red and white 
blood cells. Another study that used an acoustic nanofilters was based on "in-flow size 
fractionation”. It reported 80% exosome recovery rate from CCM and red blood cells?55, 
Successful coupling of immunoaffinity capture and electrophoresis was shown by 


1286 who were searching for non-invasive biomarkers in cancer. Cho et aP*7 


Vaidyanathan et a 
discussed the concept of electro-mitigation; however this attempt was not partially successful 
for exosome isolation, but resulted in 6596 recovery of EVs. The use of microfluidic devices 
for liquid biopsies, including the isolation of exosomes, and their role іп capturing and 
detecting biomarkers has been recently reviewed??? 

“Lab-on-a-chip” methods use small sample volumes and can typically be carried using 
less hands-on time. This approach has proven successful in working with blood plasma 
samples, naming high viscosity апа contamination with non-specific proteins аз 


downside effects??. 


2.2.8 Commercial isolation kits 


A large proportion of exosomes are lost in the process of purification, especially during 
separation from other subtypes of EVs, proteins, and cell debris. To combat this problem, 
commercial exosome isolation kits that take less time and provide high purity samples can be 
used. 

There are several kits available on the market, including ExoEasy Maxi Kit (Qiagen), 
Pure Exo exosomes isolation kit (101Bio), and Exosomes Purification kit (Norgen Biotek). 
Most of them are very easy to work with and do not require specific equipment. The basic 
principle of commercials kits is sedimentation by using a polymeric agent. Depending on the 
kit, isolation can usually be done in a short time (within an hour) and following simple 


procedures??? 


. The formation of pellets containing exosomes is achieved by mixing the sample 
with a provided solution and then centrifuging on low speed (700-1500x g). Current challenges 
for most kits are their cost, low protein yield, sample volume dependency, and inability to 


differentiate sub-classes of EVs?7?99». The major drawback for commercial kits is that the 
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polymeric reagent?? that can affect characterization of exosomes and their biological activities 
due to contamination with residual matrix??!, non-vesicular molecules*”, and other impurities 


associated with commercial kits???. 


2.3 LITERATURE REVIEW ON EXOSOME CHARACTERIZATION 


2.3.1 Dynamic light scattering (DLS), nanoparticle tracking analysis (NTA), and 
Tunable Resistive Pulse Sensing (TRPS). 


Three common methods for characterizing exosomes (such as counting and sizing) are 
DLS, NTA, and TRPS. As DLS was used extensively in this thesis, an overview of this 
technique included below. A brief description of other two techniques (NTA and TRPS) is also 
included for comparative purposes. 

DLS can be used to determining the size distribution of exosomes and microvesicles in 
a fluid (Fig. 4). The sample is illuminated at a fixed angle and changes in the subsequent 
scattering pattern are monitored. Particles move randomly through the solution following 
Brownian motion. The movement velocity depends on the hydrodynamic diameter of the 


particle, the temperature and viscosity. 
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Figure 4. Dynamic light scattering (DLS) with example of analysis of EVs 22393, 


This Brownian motion causes changes in light scattering when a laser passes through the 
solution, proportional to the hydrodynamic size of the solutes. A diffusion coefficient of 
particles (D t) can be used to determine from the data, and from this the hydrodynamic radius 


of the particles can be determined using the Stokes-Einstein’s equation (eq 1): 


D.=ksT/6mR (eq 1) 


where kg is the Boltzmann coefficient, Т is the temperature, 7 is the viscosity, and ғ is the 


radius of the vesicle??*. One of the biggest advantages of DLS is that it can measure particles 
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ranging in size from 500 nm to 6 um, but it is predominantly useful for determining particle 
size when the samples contains monodisperse particles??5. DLS struggles to calculate accurate 
particles sizes in polydispersed systems??6, due to the larger particles leading to a greater degree 
of scattering, making detection of small particles very problematic397398. 

Several studies use DLS to characterize EVs from cell culture and plasma‘, ovarian 
cancer cells???, and pre- and post-filtration frozen plasma'?. Overall, DLS is useful in 
providing the diameter range of EVs. However, no other information such as morphology, 
subtype or other biochemical information cannot be determined by this method. Unlike DLS, 


NTA is an image-based method although it follows the same Brownian principles (Fig. 5). 
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Figure 5. Nanoparticle tracking analysis (NTA) with example of analysis of EVs 292396. 


NTA allows characterization of vesicles on a single-particle level, which is critically 
important as EVs аге known to be heterogeneous in size*?!, The sample preparation is as easy 
as for DLS and samples can be recovered after measurements. When preparing the sample, it 
is important to follow the protocol and bring the sample to correct concentration, viscosity, and 
dilutions. The recommended working concentration of EVs should be in the range of 2x 108 - 
20x 108/mL*°?3, In addition, refractive index (RI) should be considered404405 as it can 
influence detection limits along with pre-analytical conditions (preparation and storage)*”?. 
Gardiner et al^ has identified RI for the variety of EVs in including urinary (fresh and 
lyophilized), neuroblastoma, blood, activated platelets, and placental (small and large) 
concluding that the values become much smaller as diameter of vesicles decreases. 
Other challenges might include air bubbles, temperature changes, and laser alignment ^06407, 

The tunable resistive pulse sensing (TRPS) method (Fig. 6) is considered as one of the 


major techniques for EV characterization, including counting and sizing 90405409 
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Figure 6. TRPS (qNano, Izon) system. (A) The Izon qNano unit, (B) cross-shaped nanopore including the elastic membrane 
in the center of which exosomes will be applied (the size of pore can vary depends on the vesicles size), and (C) the cross- 
section of a pore with example of analysis of ЕРУ 041 

Before measurements, samples of EVs are diluted in a conductive electrolyte buffer and 
then applied in the middle of a membrane pore that is located on the top of fluid cell. EVs are 
measured as a result of a change in the electrical resistance when they move across the pore of 
the membrane. Each vesicle produces a resistive pulse that is used to measure the size of an 
individual vesicle. The vesicle concentration is proportional to the count of resistive pulses per 
minute. An advantage of TRPS is its ability to characterize a polydisperse sample, providing 
distinct peaks for each different size group. A disadvantage of this techniques is that larger EVs 


can cause blockage of membrane рогев 95, 


2.3.2 Microscopy-based characterization 


Transmission electron microscopy (TEM) is commonly used for the imaging of EVs and 


can give an idea of the structure and morphology?954^'? 


. This technique has been extensively 
used in this thesis to validate morphology of exosomes and used in combination with 
immunoblotting that will be described later. Electron microscopy uses electrons that pass 
through the sample to form an image (Fig. 7). The image forms when the transmitted or the 
unscattered electrons hit a fluorescent screen that produces an image based on the density of the 
material. This image is then photographed by a camera. TEM needs to be done in vacuum as the 
electrons can be easily deflected. Additionally, the sample cannot be aqueous and needs to be 
dehydrated as imaging performed under vacuum conditions. This can affect structure and 
morphology*”. To prevent that, the sample can be fixed with 4% paraformaldehyde (РЕА) 
solution to preserve the original rounded shape?. Since EVs are composed of low electron 
dense atoms (for example, carbon), TEM of exosomes has decreased contrast of exosomes 
when compared to metal nanoparticles. For this reason usually a negative staining method (296 
uranyl acetate solution) is used??9^?, Immunolabeling techniques using gold nanoparticles 


(AuNP) conjugated with specific antibodies can be used to classify specific membrane proteins 
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expression on EVs surface*!^*5, The first study where the size distribution of the vesicles were 


reported at the single EV level was successfully conducted by Brisson et al*'^ 


. Disadvantages 
of this method might include imaging under a non-native environment, some artefacts related 
to staining and sample storage, and vesicles deformation due to their dehydration*!®. Another 
point of disadvantages are long imaging hours, requirements of highly trained personnel, and 


sample preparation. 
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Figure 7. Transmission electron microscopy (TEM) and cryo-scanning electron microscopy (cryo-SEM) principle with 
393,396,417 


example of analysis of E Vs ° 

Cryo-scanning electron microscopy (cryo-SEM) allows imaging samples in their own 
environment as sample is being flash frozen and viewed below -100°С (Fig. 7). This method, 
similarly to TEM, was used in this thesis to characterize morphology of exosomes, and was 
especially useful in assessing the purity of exosomes. This method is considered as one of the 
supplemental sources in confirming size and morphology of EVs. Unlike TEM, cryo-SEM 
does not have special procedures for sample preparation such as embedding and dehydrating. 
The sample simply gets immobilized by cooling it in liquid nitrogen. Cryo-SEM microscopy 
allows for superior quality images that confirms the rounded shape of vesicles, as has been 
reported by several research groups*!*?!, There has been a notable increase in use of cryo- 
microscopy, and it has even become a preferred method for examining morphology of vesicles 


as it is free of the dehydration-derived artefacts that can be found with TEM*®. In addition, а 


422 419,423 


few groups reported that different subcategories of EVs*** and multilayered vesicles can 
be also identified and detected. The diameter of vesicles analyzed by cryo-SEM can be slightly 
larger than TEM reported?^. This сап be explained samples remain hydrated during the 
imaging process and not desiccated as in TEM sample preparation, leading to liquid expansion 
when being cooled by liquid nitrogen. Challenges will included presence of frozen artefacts, 
low signal-to-noise ratio due to the sample thickness, sample preparation and operating skills 


on ТЕМ4!6425. Like TEM, cryo-SEM can be combined with immunoblotting with AuNP, 
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which has been successfully done by using EVs from different sources: embryo-endometrial426, 


373,427 414 


plasma , and platelets 
Atomic force microscopy is a well-known method for analysis at the nano scale that 
works by recording interactions between the probing tip and the surface of the sample (Fig. 8). 


It can be used іп EV imaging and has been successfully reported in several 5ішдіев 25,429, 
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Figure 8. Atomic force microscopy principle with example of analysis of EVs 99429. 


Confocal immunofluorescent microscopy is a broadly used technique that is used to 
provide a high-resolution three-dimensional image with an ability to focus on a thin layer in 
the sample of interest. Frequently samples are labeled (via direct 430-42 and indirect 453454 
approaches) with a specific antibody conjugated to a fluorescent dye as shown in Figure 9. 
Briefly, a laser is focused on an objective lens after being reflected by a dichroic mirror. Once 
light excites the fluorophore dye it emits at a different wavelength and travels back through the 


objective lens and mirror creating an image (Fig. 9). 
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Figure 9. Confocal microscopy principle with example of analysis of exosomes stained with Cy5 (red). Cell membrane is 
stained with PTK-7 (green)'?**955. 
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Labeling of EVs does not always require use of antibodies. For example, а research study 
of the cellular uptake of prostate cancer-derived EVs led by Roberts-Dalton et al*? reported 
that a novel thiol-based indirect labeling method allows imaging EVs over 6 h without 
compromising EVs’ integrity and fully retaining their biological function in inducing 
differentiation of fibroblasts. Another successful example of indirect fluorescent labeling was 
described by Gangadaran et al? by using Renilla luciferase (Rluc) dye іп EVs derived from 
thyroid cancer cells in mice. Visualization by fluorescence microscopy can be very useful in 


preparation for flow cytometry experiments, as will be briefly described below. 


2.3.3 Flow cytometry 
The principle of flow cytometry based on identifying particles in suspension passing 
through a laser beam collecting fluorescence from specific labels as well as forward and side 


scattered light. The data from the latter can be translated into particle size438439 (Fig. 10). 


liquid flow 
5 
ш 
laser, 8 
— Ф 
fluorescence E 
studied particle: 
scattered cell or molecule 
light 


Forward scatter 


Figure 10. Flow cytometry principle!!*3%°, BAL- bronchoalveolar lavage 


Currently, flow cytometry is widely used in analyzing EVs (300-500 nm in size396440- 


442) and their components“? 


allowing collection of information about particles’ phenotype, 
such as their number and ві2е2%, However, several studies reported detection of EVs іп a size 
of 200 пт“, 280 nm445446 and 160-200 nm^^^^^5, Historically it has been notoriously difficult 
when applied to EVs due to their size, heterogeneity, and previous lack of markers that could 
distinguish a specific population of interest“. Conjugation of EVs with microspheres 
(polystyrene-organosilicate  composites*?, streptavidin-coated particles^!, fluorescent 
multiplex beads ^5545?, latex beads 53454 and silica microspheres*”°) seems to be able resolve 


this issue. Application of conjugated EVs has been shown in characterization of serum 


exosomes of patients with pancreatic cancer? and EVs isolated from normal and pre- 
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eclampsia conditions in pregnant отеп“. A new algorithm based on Coulter principle^? has 


been proposed for distinguishing between EVs and cells, cell debris, and other aggregates ^^". 


2.3.4 Western blot 


Western blotting is one of the most widely applied bulk analysis techniques for EVs 
analysis (Fig. 11)5456. WB techniques are routinely used to detect a specific protein of interest 


extracted from cells or other sources, including exosomes??. 
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Figure 11. Western blotting technique 39457, 


Besides detecting and visualizing protein on a gel, it can also give a relative quantification 
of a protein in the sample. The process of blotting takes a few days as it includes several steps 
(protein separation by gel electrophoresis, their transfer to a membrane, and detection by 
antibodies). Each step is equally important. It is a very meticulous technique that combines the 


resolution of polyacrylamide gel electrophoresis with the specificity of immunoassays. 


2.3.5 Liquid chromatography-tandem mass spectrometry (LC-MS/MS) 


Liquid chromatography-tandem mass spectrometry (LS-MS/MS) has been frequently 
used in clinical studies allowing collection of data from protein analysis to identify biomarkers 


for various diseases*5® 


. This technique is very useful in applications if the sample is complex. 
Therefore, it can be a great asset for characterization of EVs. Separating analytes based on their 


physicochemical properties allows identification of their unique mass spectra (Fig. 12). 
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Figure 12. Principle of LC-MS/MS 4%, 


A sample for LC-MS/MS needs to be free from contamination with components that 
interfere with either the chromatographic separation or the mass spectrometry. This method of 


characterization will be described in Chapter 3. 


2.4 METHODS 
2.4.1 Cell culture 


A549 cells (ATCC®), derived from lung carcinoma tissue, were cultured in Dulbecco’s 
modified Eagle’s medium with 10% heat inactivated fetal bovine serum (FBS) (Sigma, St. 
Louis, MO, USA) and 1% Penicillin (Gibco™, USA). After reaching 80% confluency, cells 
were washed with phosphate buffered saline (PBS) with a final concentration of 0.01 M 
(composition NaCl-137 mM, KCL-2.7 mM, МазНРО4-10 mM) (later referred as 1xPBS), 
trypsinized, and collected by centrifugation (300xg for 5 min). Subcultures were made at a 
ratio of 1:3 ina 5% CO» humidified atmosphere at 37 °C for 2 days. After this time, the medium 
was replaced with fresh medium (washing, subculturing, and centrifugation steps were 


omitted) with a final concentration of 5% heat inactivated FBS for 24 h, and then bathed іп 
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medium containing 2% exosome depleted FBS (GibcoTM, USA) for 12 h to eliminate the risks 
associated with FBS background. The exosome-rich conditioned cell culture medium (CCM) 
was then collected to proceed with exosome isolation. 

WI-38 cells (ATCC®), derived from normal embryonic (3 months gestation) lung tissue, 
were cultured in Eagle's minimum essential medium with 10% heat inactivated FBS (Sigma, 
St. Louis, MO, USA) and 1% Penicillin (GibcoTM, USA). Subculturing and further steps were 
carried out as described above, except using medium and subculturing was undertaken using a 


ratio of 1:4. 


2.4.2 Collection of breath-derived exosomes 


To standardize EBC, no food was taken for 3 h prior to collection. Samples were 
collected from a single individual for three days. EBC was collected using a chilled tube 
assembly consisting of a mouthpiece through which breath was exhaled and a 15 mL , 
collection vessel. The subject exhaled for approximately 60 min, with condensate collected in 
a tube directly maintained at 0 °C. After the third collection, samples were pooled together. 
Collected EBC was filtered through 0.22 um MF-Millipore MCE membrane (Millex-GS, 
Merck Millipore Ltd., UK). The collected EBC was kept in low protein-binding tubes and used 
within 3 days of preparation. The ethical application was not required as the leading researcher 
used its own breath. Three biological replicates were collected from the same individual. Each 
biological replicate was collected within three days. Each of these biological replicates was run 
in two technical replicates for proteomic analysis (Ch. 3) and in two technical replicates in 


western blotting. The total volume of each biological replicate was 36 mL. 


2.4.3 Exosomes isolation by differential ultracentrifugation (UC) 


To obtain CCM, cells were seeded at initial density 1.9x10° cells/mL reaching 
confluency 8.3x10° cells per Т-75. Exosomes were isolated as described?" with an additional 
step after the first centrifugation. Briefly, CCM from 3.7х 10’ cells first were centrifuged for 5 
min at 300xg to pellet dead cells. Then for 20 min at 700xg, followed by 10 min centrifugation 
at 2,000xg to pellet cellular debris, 10 min centrifugation at 10,000xg and run through the 
0.45 um filter (Millipore, Millex-GP, gamma sterilized). Then CCM was concentrated by using 
ultrafiltration centrifugal concentrators with a molecular weight cut off 10 kDa (Pierce, protein 


concentrators, PES, 10 K MWCO). Recovered supernatant was collected from each step and 
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transferred to a new tube before taking the next centrifugation step. Then supernatant was 
centrifuged at 10,000xg for 30 minutes and recovered exosomes-rich supernatant was 
transferred to a clean polycarbonate bottle with cap assembly, 10.4 mL, 16x76 mm (Beckman 
Coulter, cat. 355603) and undergone 90 min spin at 100,000xg (Beckman Coulter Optima L- 
100 XP Ultracentrifuge) using type 70.1 Ti rotor, fixed angle, titanium, 12x13.5 mL rotor 
(Beckman Coulter Life Sciences). Then supernatant was decanted and exosomes were re- 
suspended in cold fresh Phosphate Buffered Saline (PBS) with a final concentration of 0.01 M 
((composition NaCl -137 mM, KCl- 2.7 mM, Na;HPO,- 10 mM) (later labeled as 1xPBS), pH 
7.4 (Thermo Fisher)). Then exosomes were centrifuged again for another 60 min at 100,000xg. 
This step was included as a washing step from any medium debris that could stick to exosomes. 
After the final step, supernatant was poured off and pellets (exosomes) were re-suspended in 
100 uL of cold 1XPBS, pH 7.4 and stored at 4 °C for future experiments. Protein concentration 
in the breath exosomes sample was measured by using a micro-BCA assay (BCA Rapid Gold 
BCA assay kit). 


2.4.4 Exosomes isolation by size exclusion chromatography (SEC) 


Exosomes were isolated following standard methodology??? modified as followed. First, 
CCM??549? underwent 5 min centrifugation at 300xg, then 20 min at 700xg, and finally at 
10,000xg for 10 min. The collected supernatant was filtered through a 0.45 um syringe filter 
(Millipore, Millex-GP, gamma sterilized) in order to separate microvesicles and remove 
unnecessary proteins and other biological materials. Then the volume of CCM was reduced to 
0.5 mL using a centrifugal filter device protein concentrator with a molecular-weight cut-off 
of 10 kDa (ThermoFisher, Pierce Protein Concentrators PES, 10 K MWCO, 0.5 mL). A 
qEVoriginal column (Izon Science) was equilibrated with ІХРВ5, pH 7.4 (phosphate buffered 
saline) buffer that was freshly filtered through a 0.22 um (Millipore, Millex-GP, gamma 
sterilized). The first 5 fractions were voided (0.5 mL for each fraction) and fractions 6-10 were 
collected individually. Fresh isolated exosomes were dispersed in the same equilibration 
buffer. According to Izon Science, the peak of exosomes elution should be in fraction 8. That 
was confirmed later by TEM, DLS, and western blotting. Following use, the column was 


washed with 40-60 mL of the filtered buffer and stored vertically at 4 °C. 
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2.4.5 Preparation biological samples for transmission electron microscopy (ТЕМ) 


The exosome sample (4 uL) was mixed with 8% paraformaldehyde in equal parts (1:1 
ratio, v/v) and dropped on a formvar/heavy carbon coated (300 mesh) grid (GSCU300CH-50, 
ProSciTech) that was previously plasma cleaned for 10 min. The sample was left for 10-15 
min to dry, followed by 2 washing cycles with Milli-Q water. After, the sample was stained 
with 4% uranyl acetate for 15 min in the dark. The grid was washed again with Milli-Q water 
and air dried. Before imaging the sample, the grid was plasma cleaned for 10 min. Images 


were analyzed on Jeol 2100F using Gatan Microscopy Suite Software 3.0. 


2.4.6 Synthesis of gold nanoparticles (AuNP) 


AuNPs were synthesized according to the protocol described by Liu and Lu**!. First 
fresh aqua regia was made by mixing HCl and HNO; in a 3:1 ratio and then used to soak two- 
neck 200-mL flasks, stopper, condenser, and magnetic stir bar for 25 min. Then all parts were 
rinsed several times in sterile deionized water. Secondly, Milli-Q water (98 mL) and 50 mM 
HAuCl, (2 mL) were added to а two-neck flask bringing the final concentration to 1 mM. The 
flask was then placed on a hot plate. When the solution began to reflux, the stopper was 
removed, and 38.8 mM sodium citrate (10 mL) was added resulting in color change from 
yellow to deep red. The solution was left to reflux for another 20 min under stirring. Once the 
solution was cooled down at RT, the diameter of nanoparticles was around 15 nm as measured 
by DLS. The size of AuNPs was further confirmed by TEM with defined size range of 18-20 
nm. The magnitude of charge repulsion “Ç potential” was then measured by using DLS with a 


result of -35 mV. 


2.4.7 Immunolabeling exosomes for TEM 


Immunolabeling of exosomes was performed as described by Jung and Mun*'? 


with a 
few differences. A sample (4 uL) of freshly isolated EVs were dropped оп a formvar/heavy 
carbon coated (300 mesh) grid (GSCU300CH-50, ProSciTech) that was previously plasma 
cleaned for 10 min and left to dry for about 20-25 min at room temperature (RT). An aliquot 
(5 uL) of primary antibody cocktail (anti-CD63 (mouse monoclonal (TS63), ab59479, Abcam) 
and anti-CD81 (mouse monoclonal (M38), ab79559, Abcam) tetraspanin general markers with 
a dilution ratio of 1:100 in 1XPBS containing 1% Bovine Serum Albumin (BSA), pH 7.4 was 


placed onto the exosomes. Sample were left overnight in a plastic container with a closed lid 
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secured with parafilm. Next day, the sample was washed in blocking buffer (1xPBS containing 
1% BSA, pH 7.4) three times for 5 min each. Gold nanoparticles (AuNPs) were synthesized 
according to the method of Liu and Lu^6! described above in 2.4.6 and conjugated to the 
secondary antibody (m-IgGx ВР-НЕР, sc-516102, Santa Cruz Biotechnology, INC) as 
previously described*!>. The grid was incubated with AuNP-conjugated secondary antibody (5 
uL) diluted to a ratio of 1:500 in 1XPBS containing 1% BSA, pH 7.4 for 1 hour at RT ina 
sealed container and secured with parafilm to prevent evaporation. The grid then was washed 
5 times with Milli-Q water, air dried, and stained with 4% uranyl acetate іп the dark as 
described above. Before imaging, the sample was plasma cleaned for 10 min. Images were 


analyzed on Jeol 2100F using Gatan Microscopy Suite Software 3.0. 


2.4.8 Cryo-scanning electron microscopy (Cryo-EM) 


Samples were prepared by using the freeze-fracturing technique. The exosome sample (5 
uL) was manually dropped on top of the sample stage and then rapidly cooled and frozen in 
liquid nitrogen. Then the frozen sample was transferred under vacuum to a cold preparation 
chamber (Alto 2500, Gatan) where the sample was mounted on the stage maintained at -140 
9С and fractured to expose interior structural details of the sample. The sample was coated with 
platinum (5-10 nm) and imaged on a cryogenic scanning electron microscope (JSM6500F, 
JEOL) equipped with a 97 uA emission gun. All samples were imaged at a working distance 


of 8.4 mm using low acceleration voltage (10 kV). 


2.4.9 Exosomes purification and characterization by western blot 


Samples from breath (total volume of EBC 36 mL) and CCM from 3.74x 10" cells (both 
cell lines) were assayed using western blotting to confirm that the isolation procedure produced 
purified exosomes. Isolation of exosomes was performed as previously described in 2.4.4, with 
blotting undertaken on samples from pooled fractions 7, 8, and 9 of SEC isolation as well as 
probing individual fractions isolated by use of the same qEV original columns (Izon Science) 
and from ultracentrifugation. Sterile medium free of exosomes and with no FBS added was 
used as negative controls. These negative controls were assayed to ensure that there were no 
contaminating proteins from media or supplements. Samples were suspended in lysis buffer 
(50 mM Tris-HCI pH 8.0, 150 mM NaCl, 1% NP-40) and subjected to two rounds of vigorous 


vortexing (10 s) and cooling on ice (20-30 min). Samples were finally centrifuged at 10,000xg 
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for 10 min. Тһе pellet was discarded, апа the protein concentration of the supernatant was 
determined using a micro-BCA assay (Pierce Rapid Gold BCA Protein Assay Kit) according 
to the manufacturer’s instructions. Samples were mixed with 4x sodium dodecyl sulfate (SDS) 
sample buffer (0.2 М Tris-HCL, 0.4 M dithiothreitol (DTT), 8% SDS (w/v), 6 mM 
Bromophenol blue, 4.3 M glycerol) and heated for 5 min at 95 °C. Samples containing 
approximately 20 ug of protein were loaded on a 10% acrylamide gel, and proteins separated 
by electrophoresis. The western blot was performed according to the protocol described by 
Lim^? with minor amendments: 5% BSA was used as a blocking agent; the membrane was 
probed with anti-CD63 at 1:800 dilution (TS63; Invitrogen, ref. 10628D), anti-CD81 at 1:800 
dilution (M38; Invitrogen, ref.10630D) and secondary mouse monoclonal (goat anti-mouse) 
secondary antibody conjugated to Horseradish Peroxidase at 1:5000 dilution (Santa Cruz, sc- 
2031). Total loaded protein lysate ~ 20 ug/lane for all samples including negative controls. 
Blotting was performed using 2ug/mL of anti-CD63 and anti-CD81. The goat anti-mouse 
secondary antibody was used at a concentration of 0.08 ug/mL. The membrane was visualized 


on the ChemiDoc Amersham Imager 600 (General Electric). 


2.4.10 Dynamic light scattering (DLS) 


The hydrodynamic diameter of freshly isolated exosomes were measured using the 
Zetasizer ZS90 (Malvern Instruments, UK) as previously described*9. All measurements were 
conducted at RT. Polydispersity index values varied in the range of 0.178-0.232. The size 
distribution profile of vesicles was obtained from three biological replicates, with each 
measured in triplicate. Each measurement was obtained from 12 runs following 30 s 


equilibration. 


2.4.11 Statistical analysis 


Analyses were performed using Prism GraphPad scientific 2D graphing and statistical 
software, version 9.1.1 (223) for macOS, April 16, 2021, San Diego, California. Data analysis 
for DLS measurements was completed using Malvern Zetasizer Software 7.12 (Malvern 


Instruments, Malvern, UK). 
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2.5 RESULTS 


To characterize exosomes from EBC it was first necessary to optimize methods for 
isolation of pure, intact, and high-quality exosomes. To this end, exosomes obtained from 
conditioned medium from cultured A549 cells were isolated by two methods: 
ultracentrifugation (UC) and size-exclusion chromatography (SEC). A549 cells are derived 
from a human lung carcinoma and were chosen as a straightforward cell model of lung cancer. 
The WI-38 fibroblast cell line is derived from the lung tissue of a three-month gestational fetus 
and was chosen as a cell model of healthy lung tissue. Both cell lines were used at early 
passages (A549 passage 2 and WI-38 passage 3) and periodically monitored for any 
abnormalities in their morphology ^. Initially, exosomes were isolated by ultracentrifugation. 
This method was chosen for its simplicity and compatibility with large volume samples. Figure 


13 shows cryo-SEM images of the isolated exosomes. 
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Figure 13. Characterization of UC-based isolated cell (A549)-derived exosomes by cryo-SEM. On the figure: (A) other EVs 
and protein aggregates and (B) exosomes isolated from A549 cell culture model. Images are representative of two independent 
experiments. Scale bar 10 um and 100 nm. 


Some of the disadvantages of this method previously described in 2.2.1 can be seeing on 
Figure 13 A, including possible mechanical damage of vesicles and co-isolated protein 
aggregates. However, much more pure samples (Fig. 13 B) were obtained following the 
optimization of UC. This included filtering exosome-enriched cell culture medium through a 
0.45 um filter after the 10,000xg centrifugation step, and pellet removal and re-centrifuging 
the isolated exosome at 100,000xXg for a second time for another 60 min allowing them to wash 
in IxPBS and then pelleted again by centrifugation, bringing the total centrifugation time to 
2.5 h. 

The second method of isolation, SEC, was tested in this study on the cell-derived 


exosomes from the same cell culture model (A549) with the goal of obtaining exosomes that 
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would be higher in purity comparing to UC method and intact after isolation. Figure 14 shows 


cryo-SEM images of the SEC-based isolated exosomes. 
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Figure 14. Characterization of SEC-based isolated cell (A549)-derived exosomes by cryo-SEM. (А) Exosomes from pooled 
fractions 7-9 апа (B) exosomes from pooled fractions 7 and 8 only. Images are representative of two independent experiments. 
Scale bar 1 um and 100 nm. 


Ав expected337, SEC-based isolation (undertaken using Izon Science qEV original 
columns) is highly effective at isolation of pure exosomes, preserving them in their intact shape 
(Fig. 14 A and B) and largely free of the membrane-like plaques or protein aggregates 
associated with the initial UC conditions!!9352 (Fig. 13 A). Asanticipated???, A549 cell-derived 
exosomes eluted from SEC after the column void volume which is represented by the first six 
0.5 mL fractions. Although exosomes can be obtained from subsequent combined size 
exclusion fractions (fractions 7-9)??? (Fig. 14 A), we found that the two fractions (7 and 8) (Fig. 
14 B) immediately following the void volume gave the highest purity exosomes, showing a 
size range (45-100 nm) that is consistent with exosomes by cryo-SEM microscopy (Fig.15) 
and DLS (Fig. 16). 
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Figure 15. Cryo-SEM images of SEC-based isolated cell (A549 and WI-38)-derived exosomes. Images are representative of 
individual fractions (A) 7, (B) 8, and (C) for A549 and (D-E) WI-38 cell-derived exosome for the same fractions respectively. 
Scale bar 100 nm. 
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Figure 16. DLS analysis of cell- and breath-derived exosomes. Size distributions of exosomes for fractions 7-9 isolated 
from A549, EBC, and WI-38. The polydispersity index (PDI) varied in the range of 0.178-0.232. The Y-axis shows intensity 
in percentage, and X-axis corresponds to the size of the vesicles (d. nm). The plots show the mean from all three biological 
replicates performed in three technical replicates and repeated three times. The error bar are not visible due to insufficient 
variability in the data. 


Although the DLS data suggest that vesicles in fraction 9 are similar in size to those from 


the earlier fractions, TEM analysis of individual fractions demonstrated that this fraction 
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contains smaller vesicles in some replicates (Fig. 17) and as a precaution against 
contamination, such later fractions should not be used where high purity exosomes are 


required. 


Figure 17. TEM images of smaller EVs of cellular and breath origin. Images are representative of smaller EVs found in 
fraction 9 from (A) A549 cells, (B) EBC, and (C)WI -38 cells. Scale bar 50 nm. 


The optimized conditions were then applied to isolation of exosomes from WI-38 healthy 
lung tissue cell model and EBC. As can be seen in Figures 15 and 16, the vesicle sizes for each 
isolated fraction are consistent across the cell lines and EBC and are in the 45-100 nm range 


expected for exosomes in fractions 7 and 8 (Fig. 18). 
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Figure 18. Size distribution of cell- and breath-derived exosomes. The distribution of vesicle diameters (d.nm) as 
determined by DLS for SEC fractions isolated vesicles from EBC, WI-38, and A549 cells. The Y-axis corresponds to the 
size of the vesicles (d.nm). Data represents the average of three biological replicates, each evaluated through three 
technical replicates. Error bars represent the standard deviation of the biological replicates. The figure was generated 
by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


This was further confirmed through microscopy on breath-derived exosomes (Fig. 19). 


Again, vesicles from fractions 7 and 8 were consistent across replicates and of morphology 
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consistent with exosomes (Fig. 16). This confirmed that these fractions are suitable for use in 


the development of exosome-based diagnostics. 


Figure 19. Characterization of SEC-based isolated breath-derived exosomes by TEM and cryo-SEM. Images are 
representative of breath-derived exosomes isolated from EBC showing fractions (A) 7, (B) 8, and (C) 9 and cryo-SEM images 
of fractions (D) 7, (E) 8, and (F) 9. Scale bar 100 nm. 


The scanning transmission electron microscopy-energy dispersive spectroscopy (STEM- 
EDS) chemical mapping technique was used to distinguish exosomes from contaminants that 
might be present in the sample. This powerful analytical technique enabled analyzing the 


elemental identity of exosomes and was performed on exosomes isolated from EBC (Fig. 20). 
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Figure 20. STEM-EDS тар and full EDS spectrum of a single breath-derived exosome. Images are representative of a 
SEC-based isolated (fraction 8) single breath-derived exosome. EDS elemental analysis spectrum shows well-defined peaks 
of carbon, oxygen, and uranium labeled with an arrow. The elemental mapping includes (A) bright field image and elements 
identified in the vesicle: (B) carbon, (C) oxygen, (D) sodium, (E) silica, and (F) uranium. The Y-axis represents the number of 
counts per channel and the X-axis represents X-ray energy (keV). Scale bar 20 nm. 


The sample was placed on a copper grid and stained with uranyl acetate, explaining the 
presence of these otherwise unexpected elements in the EDS spectrum. The presence of 
abundant carbon and oxygen that maps as co-located with the exosome is consistent with it 
being of organic origin. This validation is important, as nanosphere impurities within the same 
size range as exosomes under certain circumstances were observed. These might arise from 
etching of lab glassware during buffer storage or autoclaving, as water etching of glass has 
been reported*®. Silica nanoparticles of this size range are known^66 and this would be 


consistent with their STEM-EDS map and EDS spectrum (Fig. 21). 


Figure 21. STEM-EDS map and full EDS spectrum of a silica (silicon dioxide) ball. Images are representative of a silicon 
dioxide ball that mimics morphology of exosomes. EDS elemental analysis spectrum shows a well-defined silica peak labeled 
with an arrow. The elemental mapping includes (A) a bright field image, (B) carbon, (C) oxygen, (D) sodium, (E) silica, and 
(F) uranium. The Y-axis represents the number of counts per channel and the X-axis represents X-ray energy (keV). Scale bar 
100 nm. 


To confirm further that the isolated vesicles were exosomes, immunoblotting was 
performed using a cocktail of primary antibodies against CD63 and CD81, which are known 
general tetraspanin EV markers**’ (Fig. 22 A and B) that are embedded in the surface of 
exosomes. The immunoblotting was visualized using TEM through gold nanoparticles (AuNP) 
(Fig. 23), conjugated to a secondary antibody (m-IgGx BP-HRP) that interacts with both 


primary antibodies. A negative control, when the sample was blotted only with the secondary 
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antibody, was performed to show that there was no non-specific binding (Fig. 22 C). This 


confirms that individual putative exosomes contain a relevant marker. 


Figure 22. Grid-immunoblotting-based exosome detection. TEM images of exosomes from (A) WI-38 cells and (B) EBC 
treated first with a cocktail of anti-CD63 and anti-CD81 then with a secondary antibody conjugated to AuNP. (С) EBC 
exosomes were also treated only with the secondary antibody, indicating that association of the AuNP with the exosome is 
dependent on the presence of the primary antibody. White arrows on images point to selected AuNPs. 
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Figure 23. STEM-EDS тар апа full EDS spectrum of AuNP used іп grid-immunoblotting. The images are representative 
of AuNP conjugated to the secondary antibodies used for immunoblotting. EDS elemental analysis spectrum shows a well- 
defined gold peak labeled with an arrow. The elemental mapping includes (A) a bright field image, (B) nitrogen, (C) gold, 
and (D) uranium including overlay of all elements. The Y-axis represents the number of counts per channel and the X-axis 
represents X-ray energy (keV). Scale bar 20 nm. 


In contrast, conventional western blotting approaches only report the average presence of 
markers within the sample for non-tumorigenic exosomes including cell- and breath-derived 
exosomes (Fig. 24 A) as well as their presence in the whole proteome of both cell lines (Fig. 
24 В). 
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Figure 24. Validation of cell- and breath-derived exosomes by western blot analysis. Western blot analysis using general 
tetraspanin EV markers CD63 and CD81 on cell (WI-38)- and breath-derived exosomes. (А) Samples that contain exosomes 
isolated by two different methods SEC (combined fractions 7-9) and UC are labeled SEC+ and UC-, respectively. Exosome- 
free negative samples (matching total protein content to exosome samples) are also shown and isolated by the same methods 
are labeled SEC- and UC-, respectively. The selected markers demonstrate that the methodology of breath exosome isolation 
is equivalent to the established isolation methods from cell culture medium. (B) Validation of the same markers is also shown 
on cell-derived exosome isolated by UC from A549 and WI-38 cells and their whole proteome. 


2.6 DISCUSSION 


Itis well known that cell culture media can contain several types of membrane vesicles, 
including apoptotic blebs and microvesicles in addition to exosomes 223! 3839, Because of this, 
the purification steps used to isolate exosomes are critical to yield and purity. Although, UC is 
a simple and low-cost method of extraction that is amenable to large sample sizes, vesicles 
isolated through UC were found to be fragmented and impure (Fig. 13 A). This is consistent 


with previous observations 6546? 


and can be explained as the consequence of shearing force 
that can potentially change the original morphology of the vesicle deforming 1225470, 
Previous studies have also found that vesicles isolated by UC do not demonstrate the 
purity associated with other forms of isolation337352. which makes results from use of the 
samples uncertain. Although immunoaffinity capture gives high quality exosomes???, we chose 
to use SEC as it is not biased towards exosomes bearing the particular markers chosen for 
immunoaffinity capture, but generates similarly pure exosomes337352_ A feature of SEC is the 
ability to optimize towards yield or purity. As our envisaged use for breath-derived exosomes 
is for development of diagnostics, we optimized for purity, rejecting fractions from the size 
exclusion column that are likely to contain contaminating vesicles or protein aggregates. 


However, in the later stages of this study, the UC method was re-considered and applied with 
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several modifications leading to isolation of exosomes with much higher purity than originally 
isolated via UC. The rationale behind this decision is explained in Chapter 3 (3.3.1). 

Whilst our isolation protocol delivered samples with a relatively homogeneous range of 
sizes that are consistent with exosomes, further phenotyping^" ^7 was required to confirm that 
these breath-derived vesicles are genuine exosomes. This phenotyping provides 
characterization of breath-derived exosomes that is missing from current literature. Exosomes 
derived from breath are consistent with those from other sources. 

Our experience is that breath exosome samples stored for more than a day begin to lose 
their well-defined morphology and artifacts begin to appear in samples analyzed through 
microscopy (Fig. 21). Whilst we have used both TEM and cryo-SEM in this phenotyping work, 
we found that cryo-SEM provides better definition of morphology of the vesicles, as noted by 
Wu et al?!. The composition of the isolated vesicles is fully consistent with exosomes, being 
comprised of organic matter (STEM-EDS) and having a protein content consistent with 
conventional cell-derived exosomes. 

The use of STEM-EDS is a valuable method of distinguishing exosomes from silica 
nanospheres, which can be readily mistaken for exosomes based on microscopy alone as they 
are morphologically similar. Early in our method optimization, silica contamination was found 
when running STEM-EDS elemental analysis. It seemed likely that these contaminants came 
from the glass used in reagent storage. Analysis of PBS stored in glass confirmed that it is a 
potential source of silica contamination (Fig. 21). Our methodology delivered breath-derived 
exosomes that were free of silica nanosphere impurity through use of a 1xPBS buffer that was 
prepared on the day it was to be used and immediately transferred to high grade plasticware 
following filtration (0.22 um MF-Millipore MCE membrane (Millex-GS, Merck Millipore 
Ltd., UK)). 

The average of size of the particles was roughly similar across the major exosome- 
containing fractions as would be expected following size exclusion chromatography. The 
vesicles are in the range of 45-100 nm as seen by DLS, cryo-SEM and TEM. STEM-EDS 
demonstrates that they are of organic origin, and the isolated EVs are confirmed to contain 
protein. Evidence from microscopy, DLS, and protein composition points to the need to 
undertake careful method development for demanding applications, such as diagnostics. Based 
on the results of this study, only a narrow range of size exclusion fractions are found to contain 
clean exosomes. Fractions eluting after these are contaminated with smaller structures (Fig. 
17) and high protein content. In general, protein concentration is indicative of exosome 


concentration?*. However, this will not be true where vesicles are contaminated with protein 
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aggregates. Normalising the protein concentration to DLS intensity, acting as a useful proxy of 
exosome concentration, powerfully illustrates this with the last collected sample fraction 
containing up to approximately 10 times more protein per unit of isolated exosomes than the 
pure exosome fractions. Given their potential use in protein-based diagnostics, it is particularly 
important to avoid such protein contamination in purified exosome fractions. 

The immunoblotting-grid technique was used to validate the presence of the general 
tetraspanin EV markers CD63 and CD81 which are embedded in the surface of exosomes. 
These were first described by Brlansky and Derrick*” in detection of plant viruses, but have 
since been used extensively as exosome markers*!>4!6, Whilst technically demanding, this 
technique provided a high standard of proof that the exosome-specific markers are located on 


the purified exosomes. 


2.7 CONCLUSION 


The aim of this chapter was to describe the isolation of cell- and breath-derived exosomes 
and characterize them using physicochemical methods. We aimed firstly to optimize the 
methodology to isolate exosomes that are sufficiently high quality from cell-based sources and, 
secondly, to apply the optimized methodology to isolate exosomes from human breath. 
Following this, phenotyping of breath exosomes was carried out by comparing them to cell- 
derived exosomes. This study provides detailed characterization and comparison of two 
methods of isolation: ultracentrifugation and size-exclusion chromatography. 

Through careful characterization, this study confirms that exosomes can be isolated 
in a highly purified form from breath. Using size exclusion chromatography, these exosomes 
can be isolated in sufficient purity to be used for diagnostic purposes, such as proposed for 
early-stage detection of lung cancer. These demanding applications require highly purified 
exosomes. Isolation based on size exclusion chromatography should be linked to different tests 
to determine which fractions contain exosomes of sufficient purity. We found that a subset of 
chromatographic fractions is reproducibly of high standard, passing metrics on morphology, 
size, exosome-located markers, and protein contamination. The application of 
ultracentrifugation gave samples that were pure as those obtained from size-exclusion 
chromatography, whilst having the advantage that isolation from large sample is possible with 


several modifications. 
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Phenotyped exosomes from EBC are equivalent to exosomes isolated from more 
conventional sources, such as cell culture media. The findings from this chapter will 
allow us to characterize breath-derived exosomes with a goal to determine whether exosomes 
obtained from WI-38 cells provide a valid proxy for breath-derived exosomes using 
bioinformatics techniques based on the identified proteomes (ie., compare WI-38 cell 
proteome to human breath proteome). In addition, proteomic analysis of exosomes from cell 
models of normal lung tissue (WI-38 cells) and lung cancer (A549 cells) will be used to propose 
biomarkers. Western blotting technique will be used to test whether proposed biomarkers of 
lung cancer from the cell studies are absent from the exosomes of breath of healthy human 


volunteers. 
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CHAPTER 3. PROTEOMIC PROFILING CELL- AND ВКЕАТН- 
DERIVED EXOSOMES AND PROPOSED BIOMARKERS 


Aim: 

The aims of this chapter were to 1) define the proteome of exosomes from cell models 
of normal lung tissue (WI-38 cells) and lung cancer (A549 cells) to demonstrate that it is 
possible to identify potential biomarkers of lung cancer and evaluate whether those may be 
revealed on exosomes to develop a proof-of-principle test using cell culture model and 2) 
determine the benchmark proteome from breath-derived exosomes and use those exosomes as 


a negative control to test the proposed putative biomarkers of lung disease. 


Objectives: 

1. Identify the proteins found in cell- and breath-derived exosomes using proteomic 
methodology. 

2. Use proteomic analysis of exosomes from cell models of normal lung tissue (WI-38 
cells) and lung cancer (A549 cells) to propose biomarkers of lung cancer. 

3. Determine whether exosomes obtained from WI-38 cells provide a valid proxy for 
breath-derived exosomes using bioinformatics techniques based on the identified 
proteomes. Use western blotting to test whether proposed biomarkers of lung cancer 
from the cell studies are absent from the exosomes derived from the breath of healthy 


human volunteers. 


3.1 INTRODUCTION 


3.1.1 Overview of proteomics 


One of the first steps towards modern-day proteomics was the introduction of two- 
dimensional gel (2D-gel) by O’Farrell іп 197544 who developed the protein separation 
technique while studying Escherichia coli. This technique was able to resolve 1100 different 
components with a maximum of 5000 proteins. Although electrophoresis allowed for 
separation of proteins, there was difficulty in separating hydrophobic and extremely acidic or 
basic proteins” and had low reproducibility*’®. At that time, Edman degradation was the main 
technique for protein identification through sequencing and was of low sensitivity*””. The 
major improvement in proteomics was associated with creating the first database of proteins 


based on 2D gels analysis. The prototype of this database was the result of 3 years’ work of 
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electroblotting and isoelectric focusing done by Aebersold et alW?š 480 who realized the great 
potential of internal sequencing analysis. 

The word “proteome” was created by Mark Wilkins while doing his PhD studies at 
Macquarie University in Australia and represent two worlds “PROTEin” and “genOME” 
combined together. The term “proteomic” appeared in 1995 and was defined as a large-scale 
characterization of the entire set of proteins that plays a critical role of all living organisms, 
tissues and cells**'. Mass spectrometry was used in proteomic studies as it could provide 
molecular fingerprinting to identify proteins. 

With its further development, proteomics split in several branches relating to slightly 
different experimental goals. For example, protein expression (comparing proteomes between 
samples), structural proteomics (determination of atomic resolution three-dimensional 
protein structures on a genome-wide scale in order to better understand the relationship 
between protein sequence, structure, and function)**?, functional proteomics (characterization 
of enzyme activities as well as protein-protein interactions), proteome mining (drug discovery 
target), and finally post-translational modification analysis including phosphorylation, 
glycosylation, methylation, acetylation, amidation, and many other types 55-454, 

Since the introduction of 2D gels, proteomics has become more established, and new 
technologies related to sample preparation and analysis have emerged. Nowadays, the three 
pillars of human proteomics include protein fractionation techniques (complex protein or 
peptide mixtures), mass spectrometry technology to acquire the data for individual protein 
identification, and bioinformatic analysis (those steps will be described later in this chapter)**>. 
Before proceeding with sample preparation, the methodological approach of the experiment 
has to be chosen. There are two main methodological approaches: bottom-up and top-down 
proteomics. The choice of the most suitable approach depends on the type of study being 


undertaken, such as proposing biomarkers or developing a protein-based drug. 


3.1.2 Proteomic approaches and techniques in identification of proteins and reliable 


biomarkers 


There are two common approaches for protein characterization: bottom-up and top-down 
approaches. Each of them has their own advantages and disadvantages. The bottom-up 
approach with label-free quantification for identification of proteins is based on spectral 
counting of identified matched peptides after MS/MS analysis. This was a primary method 


used in this study and is explained in detail below. Two other strategies, labeling and chemical 
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tagging, аге also considered bottom-up approaches. For comparative purposes, (һе top-down 
approach is also briefly introduced. 

Bottom-up proteomics is often used in large-scale studies of proteins. This method is 
based on digestion of proteins (in-solution or in-gel) before they can be identified through 
liquid-chromatography-tandem mass spectroscopy (LC-MS/MS). Once the sample is in the 
instrument, peptides are separated by LC, ionized, volatilized, and passed into the first mass 
spectrometer (MS-1), which then separates those ions based on their mass-to-charge ratio 
(m/z). Peptide ions are fragmented into daughter ions in a collision chamber and passed through 
to the second spectrometer (MS-2), which separates and detects the fragment ions, also based 
on their m/z values. Substantial informatics analysis allows determination of the sequence of 
peptides detected in MS-1 based on the MS-2 daughter fragment ions. These sequences are 
used to predict the proteins present in the original sample*95. Various schemes of quantification 
are available, which are based on the signal intensities at either MS-1 or MS-2. MS-2-based 
quantification is most common if the original sample is a complex mixture. 

Bottom-up proteomics is a widely used method of protein identification with an absolute 
quantitation of protein expression. There are several strategies that have been developed to 
assist with better quantification. The most employed are metabolic labeling, chemical tagging, 
and label-free quantification. 

The metabolic labeling approach uses stable isotopes to label amino acids in cell culture 
(SILAC) during cell growth and division. Since first introduced in 2002, this method uses 
signal quantification from MS-1 spectra and was described by several groups487-459 The 
method was developed to allow accurate quantification of the differences between different 
cell populations, such as a treatment and control in an experiment. One cell population is 
cultured in standard culture media, while another cell population is incubated with media 
containing isotopically labeled amino acids, typically including arginine or lysine containing 
six ?C labels. The treatment population can be combined with the control, yet still be 
distinguished by mass spectrometry as one is “heavy” (containing the isotopically labeled 
amino acids) and the other is “light” (containing non-labeled amino acids). Mixing populations 
allows greater sensitivity in the experiment and higher reproducibility as treatment and control 
are quantified simultaneously in the same LC-MS/MS run. Simultaneous quantification is 
achieved in the mass spectrometry spectra as each peptide will appear as a doublet that has 
distinct mass differences, the heavy or light version. Based on the relative mass spectrometry 


signal intensities from the mixed population, the differential protein abundances between two 


73 


samples can be calculated by comparing the intensity differences of the pair of isotope labeling 
peaks in mass spectrometry ^o, 

Two common methods of quantification use chemical tagging: isobaric tags for relative 
and absolute quantification (ІТЕАО) and tandem mass tags (TMT)?!. Both approaches are 
based on defining intensity of reporters ions of different masses that are released from 
isobarically labeled peptides 92493. Because of the isobaric nature of these tags, they have 
identical physicochemical properties, more specifically identical masses. Tagged samples, for 
example from treatment and control cell populations, can be mixed, and peptides from each 
population appear in MS-1 as a single composite peak reporting the same mass value and 
resulting from the same retention time. Following fragmentation of the labeled peptides, MS- 
2 based quantification generates reports of ion peaks from different tag fragment masses and 
so allows quantification of the different cell populations under study. Both ІТКАО and TMT 
tags have the same general components: reporter and balance groups and label amino termini 
of peptides and the side chains of lysine. However, the structure of the isobaric tags differs 
between the approaches. iTRAQ uses 4-plex and 8-plex isobaric tags^^, while TMT uses 2, 4, 
6, 10, 16-рІех“% and even 27-plex^9. Based on their molecular differences, iTRAQ and TMT 
use different isobaric tags to simultaneously identify and quantify proteins. 

Whilst labeling methods, such as SILAC, iTRAQ and TMT can be multiplexed, label- 
free quantification is usually chosen if several separate samples need to be compared as it 
allows comparison of a sample with any other samples, whereas in labeled experiments, 
samples directly compared with those samples that were physically mixed and measured 
together with a control in one run. There are currently two fundamentally different approaches: 
quantification of ion intensity (changes in either peptide peak areas or peak heights) and 
quantification of spectra (spectral counting of identified matched peptides after MS/MS 
analysis). The second of these was used in this study. 

The principle of chemical tagging for both techniques is generally the same. The iTRAQ 
and TMT markers are used for labeling proteins inside living cells. Both techniques can label 
proteins from multiple sets of data (treatments and controls). For example, changes in protein 
abundances сап be compared between cancer and non-tumorigenic samples *?". Although the 
principle of labeling peptides is the same, their molecular structure and labeling specification 
are different. 

In one approach to quantification of ion intensity, the area for the m/z peak of each 
peptide is determined, as this has a linear relationship to peptide concentration. This approach 


has some challenges with reproducibility and can result in different peak intensities depending 
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on the machine settings and sample preparation. Unlike quantification of ion intensity, spectral 
counting, which was used in this study, compares numbers of identified MS/MS spectra from 
the same protein present in the same sample in each of multiple LC-MS/MS datasets^?. While 
this approach does not require complex data analysis compared to the quantification of ion 
intensity approach, normalization is required. This approach had been usefully demonstrated 
in studies that directly relate to this project: biomarker identification?* and the ability to 
distinguish between cancerous and non-tumorigenic cells*”’. 

Top-down proteomics is based on analysis of intact proteins without use of enzymatic 
digestion. Unlike the bottom-up approach, top-down proteomics retains more information 


about the parent proteins in the sample, and so is most likely to identify PTMs that cannot be 


detected through the bottom-up approach. 


3.1.3 Proteomics studies of exosomes 


357,500,501 


Although isolation and purification of exosomes is challenging , several studies 


have presented comparative proteomic analyzes of exosomes and circulating EVs from 
different biological fluids 74502503, 

Recently, numerous studies have reported that the composition and content of exosomes 
can be used for cancer diagnosis (Tables 1 and 4). Proteomic studies demonstrate that 
exosomes carry a complement of proteins that indicates the nature of their cell of origin. 

This will include proteins that сап be used as biomarkers of disease, facilitating use of 
exosomes in a cancer diagnostic device. Using exosomes for diagnostics based on proteomic 
analysis requires high purity exosomes, free of contamination from other EVs or protein 
aggregates. This requires careful attention to the isolation methodology (as demonstrated in 
Chapter 2), which may need to be matched to the source of exosomes, as obtaining high quality 
exosomes from some biological materials can be especially challenging. 

In the past 20 years, mass spectrometry studies have been investigating the composition 
of exosomes in order to facilitate biomarker discovery. The specific protein cargo of exosomes 
arises from the combination of the parental cell type and its environmental conditions as 
previously described in 1.1.3. Both bottom-up and top-down proteomic approaches have been 
applied to the study of the protein profile of exosomes. Early proteomic analysis of vesicles 
mainly concentrated on characterization of highly abundant proteins that were separated using 
2D gels and then characterized by matrix-assisted laser desorption ionization-time of flight 


mass spectroscopy (MALDI-TOF-MS), through which important exosome markers 
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(tetraspanins, annexin, and heat-shock proteins) were identified. At that stage only a small 
number of proteins could be identified. Later, with an improvement of EV isolation 


techniques? 


, protein separation, and mass spectroscopy, thousands of proteins from various 
types of biological samples have been identified and the integrated web-accessible database 
(Vesiclepedia) has been created^95, 

A comprehensive study of the exosome proteome was undertaken by Choi et al?6, This 
study acknowledged that mass spectrometry analysis can identify "thousands of vesicular 
proteins" and so proceed with their cataloging and other interrelationships. This study explored 
protein-protein interaction of mammalian vesicles underlining the topology of exosome 
proteins. By using integrated proteomic data, they were able to generate a platform that 
identifies protein-candidates related to EVs. Extensive studies over the past 10 years have 
attempted to characterize the exosome proteome and especially the cancer exosomal proteome. 
This interest in the proteome of various sources of exosomes arises from a desire to develop 
clinical diagnostic tools for early detection or monitoring of disease. For example, comparative 


proteomic evaluation has been reported on exosomes isolated from different biological fluids, 


507 508 509 


such as urinary ехоѕотеѕ?°”, circulating serum ехоѕотеѕ?, and seminal plasma exosomes 
as well as proteome of specific cancer related exosomes such as bladder cancer exosome?!? and 


ovarian cancer exosomes’® ` 


3.2 METHODS 


3.2.1 Protein precipitation for cell- and breath-derived exosomes 


Exosomes were purified as described in 2.4.1. Collected exosomes were lyzed with 8 M 
urea solution following flash-frozen (-80°C)/thawed cycle (hot running tap water) repeated 3 
times. The 1узаїе was vortexed for about 10 s between each cycle. Then samples were 
centrifuged at 13,000х for 5 шіп and protein rich supernatant was collected and transferred 
into clean Eppendorf LoBind tubes. Then proteins were precipitated by addition of 400 uL of 
acetone per 100 uL of supernatant followed by chilling at -20 °C. After 1 hour, vials were 
vortexed and centrifuged at 13,000xg for 10 min. Acetone was carefully pipetted and residual 
acetone was removed by air-drying, providing a dry protein pellet for analysis. The pellet of 
proteins was collected from the bottom of the Eppendorf tube, resuspended in 10 uL of 8 M 


urea, and kept for further analysis. 
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3.2.2 Whole proteome precipitation 


Cells (A549 and WI-38, 2.1х107сеП5/еасһ cell line) were detached with trypsin, 
transferred to a 50 mL centrifuge tube (Falcon®), and centrifuged at 300xg for 5 minutes. 
Medium was decanted and pellets were washed twice with 5 ml of ice-cold 1xPBS. Pellets 
then were recentrifuged at 300xg for 5 min. PBS was decanted and pellets were subjected to 
lysis with 8 M urea following flash-frozen (-80°C)/thawed cycle (hot running tap water) 
repeated 3 times. The lysate was vortexed for about 10 s between each cycle while the sample 
was in liquid phase. Samples were centrifuged at 13,000xg for 5 min, and protein-rich 
supernatant was collected and transferred into a clean Eppendorf tube. Proteins were 
precipitated by addition of 400 uL acetone per 100 uL of supernatant, followed by cooling to 
-20 °C. After 1 hour, vials were vortexed and centrifuged at 13,000xg for 10 min. Acetone 
was carefully aspirated, and residual acetone was removed by air-drying. The pellet of proteins 
was collected from the bottom of the Eppendorf tube, resuspended in 10 uL of 8 M urea, and 
kept for further analysis. 


3.2.3 Protein quantification assay 


A Pierce BCA Rapid Gold protein assay kit (Thermo Fisher) was used to quantify the 
amount of protein in the sample following manufacturer's instructions. First, two-fold serial 
dilutions of an albumin standard (BSA) were performed, covering protein concentration in a 
working range of 20-2000 ug/mL. A 20 uL aliquot of each of the standard dilutions was added 
to a 96-well microtiter plate in triplicate. Then, 10 uL of a cell- or exosome-derived protein 
sample was added to each well (in triplicate) in the same 96-well plate and brought up to 20 
uL final volume (2-fold dilution). The working reagent was made by following the ratio 50:1 
(reagent А: B, v/v). Then 200 uL of the working reagent was added. The plate was wrapped 
in aluminum foil and left on the bench for 10 min. The absorbance of the samples was 
determined at excitation wavelength (Acx) of 480 nm and emission measured at a wavelength 
(Aem) of 525 nm using an EnSpire 2300 multilabel plate reader (Perkin Elmer). The blank 
absorbance (buffer + working reagent) value was measured and subtracted from the absorbance 
value of the standards and all samples. The protein concentration of each sample was 
determined through application of the BSA standard curve, which was constructed using Prism 
GraphPad software (version 9.1.1 (223)) for macOS, April 16, 2021), San Diego, California. 
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3.2.4 Protein digestion (in-solution) 


Protein pellets from acetone precipitation were treated with dithiothreitol (DTT) to a final 
concentration of 10 mM. Samples were wrapped in foil, vortexed and heated at 56 °C for 15 
min followed by another 15 min incubation step with agitation (shaking) at КТ. Samples were 
alkylated with 10 mM iodoacetamide (IAA) at ЕТ in the dark for 30 min and diluted with high- 
grade LC-MS water so that the residual urea concentration was no more than 2 M. The protein 
concentration was determined as described in section 3.2.3, and sufficient quantity of trypsin 
was added so that it was present in a 1:50 ratio (trypsin: protein, w/w). The solution was 
carefully mixed and incubated overnight at 37 °C. The resultant peptides were purified using 
100 uL ОМІХ C18 tips by using different stock solutions: washing solution (0.1% 
trifluoroacetic acid (TFA)), conditioning solution (5096 acetonitrile (ACN)/water, v/v), and 
elution solution (50% ACN/water, у/у/0.1% formic acid (FA)) as follows: 100 uL of 
conditioning solution was aspirated and solvent was discarded. This step was repeated 10 
times. From this point, no air was drawn into the tip. The same cycle was repeated but with 
100 uL of washing solution finishing the equilibration step and moving on to the sample 
binding step. The sample was loaded into tip and then dispensed. This aspirate-dispense cycle 
was repeated 5 times for maximum efficiency. After aspirating 100 uL of washing solution, 
the solvent was discarded. The washing step was repeated 4 times. For the final, elution step, 
100 uL of elution solution was aspirated and the peptides in the total volume of 100 uL were 
eluted into a new vial. Eluted peptides were transferred into a SpeedVac vacuum concentrator 
(Labconco CentriVap Concentrator, 78100-00) attached to the CentriVap Ultra-low cold Trap 
(Labconco, model 7811021; with specification -50 °С and 99.3% sample recovery). After 
drying samples for 40 min, completely dried peptides were resuspended іп 20 uL of 0.1% FA 
and then transferred to a glass sample vial for LC-MS/MS analysis. 


3.2.5 Protein digestion (gel-tube method) 


Protein pellets precipitated from exosomes as described in 3.2.1 were resuspended in 
20 uL of 8 M urea solution. The purification began with making gel plugs. To a 20 uL sample 
of resuspended protein, 2% SDS (20 uL of 10% SDS stock), water (30-35 uL), 7.5% 
acrylamide/bis-acrylamide (25 uL), 0.25% tetramethylethylenediamine (TEMED) (0.25 uL) 
were added making a final volume of 100 uL. The sample was vortexed and centrifuged until 


all bubbles disappeared. To that, 0.25% ammonium persulfate (APS) (5 Ш) was added to 
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initiate polymerization. Тһе tube was briefly vortexed and rapidly centrifuged. After 
polymerization, the formed gel plug was fixed іп 2% phosphoric acid, 30% ethanol for 1 hour. 
Briefly, the fixative solution was added on top of gel plug, the gel plug was lifted with clean 
plastic tip, and tube was placed upside-down on a rocking table. After an hour, the fixing 
solution was replaced with fresh 2% phosphoric acid and 30% ethanol solution and fixed for 
another hour. Then solution from the tube was aspirated and gel plugs were left in 30% ethanol 
for 1 hour to shrink (at this point, gel plugs can be stored at -20 °C if the experiment needs to 
be split into several days). After, the 30% ethanol solution was discarded, and gel plugs were 
cut into 2 mm sections and each section into approximately 2 mm? pieces. Gel pieces were 
placed into a clean 2 mL Eppendorf tube, washed four times with 400 uL 75% ACN, 25% 
NH4HCOs at 25 mM, and then dehydrated with 400 uL ACN. The ACN was removed. 
Reduction of disulfide bonds was initiated by adding DTT to a final concentration of 10 mM 
to cover all gel pieces. Samples were wrapped in foil, vortexed, and heated at 60°C for 30 min 
followed by alkylation with 55 mM IAA at RT in the dark for 20 min. After the alkylation, the 
solution was aspirated, and gel pieces were washed 3 times in 200 uL 25 mM NH4HCO; and 
200 uL ACN and dehydrated twice with 200 uL ACN. Trypsin was added at a 1:100 (trypsin: 
protein, w/w) ratio in a volume sufficient to cover all gel pieces and incubated overnight at 37 
°C. To extract peptides, 200 uL 60% ACN in 0.1% FA solution was added for 1 hour under 
agitation. The solution containing digested peptides was aspirated and transferred to a clean 
vial. To further extract peptides, 160 uL 100% ACN was added for 30 min under agitation. 
The solution after the second extraction was aspirated and pooled together with the first 
collected extract, bringing the final volume to 360 uL. Gel pieces then were discarded. The 


peptide solutions were dried in a CentriVap and resuspended in 0.196 FA as described in 3.2.4. 


3.2.6 Peptide quantification 


The peptide mixture (resuspended in 10 uL of 0.1% FA) obtained following method 
(peptide digestion) was quantified using a Pierce Quantitative Fluorometric Peptide assay kit 
(Thermo Fisher) following manufacturer's instructions. The fluorescently labeled peptides 
were detected using a fluorescence plate reader (EnSpire 2300 multilabel plate reader, Perkin 
Elmer) with excitation wavelength (ex) of 390 nm and emission measured at a wavelength 
(Aem) of 475 nm. For the measurements, 10 uL of the peptide digest assay standard (1 mg/mL), 


1.5 mL, tryptic digest of a protein standard, іп 50 mM ammonium bicarbonate and 0.05% 


79 


sodium azide provided with the kit and processed purified protein samples were transferred in 
a fluorescent-compatible black 96-well microplate (ThermoFisher). Peptide digest assay 
standard was prepared in 1:2 fold dilutions. Then, 70 uL of Fluorometric Peptide Assay Buffer 
was added to each well followed by 20 uL of Fluorometric Peptide Assay Reagent added to 
each well. After 5 min of incubation time measurements were taken. The standard curve was 


used to determine the peptide concentration of each unknown sample. 


3.2.7 Matrix-assisted laser desorption ionization-time of flight mass spectroscopy 
(MALDI-TOF-MS) 


For mass spectrometry analysis, 1 uL of protein-containing eluate was purified using 100 
uL OMIX C18 tips and mixed in a 1:1 ratio (v/v) with a-cyano-4-hydroxycinnamic acid matrix. 
The matrix was prepared fresh on the day of the experiment. To 10 mg of the previously 
mentioned matrix (ThermoFisher) 50:50 water/ACN (v/v) with 0.196 TFA final concentration 
was added. The mixture was first vigorously vortexed for 1 min to dissolve (some undissolved 
matrix will remain), then centrifuged for 20 sec at 2,000хр, and transferred to a new tube. In 
duplicate, 1 uL of the final solution was deposited onto reusable target АВ SCIEX Opti-Tof™ 
Cal Mix 5 plate with 384 sample positions and allowed to air dry. The AB SCIEX TOF/TOFTM 
5800 matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer 
(MALDI-TOF-MS) was calibrated using 700-4000 m/z calibration mix. Tandem mass 
spectrometry was performed on the 20 most intense peaks in each sample, generated by 
applying 8,000 laser power per spot. To assign identities to the sample, the ProteinPilot™ 
(SCIEX) software (v.5.0)?!! was used. 


3.2.8 LC-MS/MS analysis (cell-derived exosomes and whole proteome) 


The peptide mixture (resuspended in 20 uL of 0.1% FA) obtained following method 
3.2.4 was quantified using a Pierce Quantitative Fluorometric Peptide assay kit (Thermo 
Fisher) following manufacturer's instructions. The fluorescently labeled peptides were 
detected using a fluorescence plate reader (EnSpire 2300 multilabel plate reader, Perkin Elmer) 
with excitation wavelength (Acx) of 390 nm and emission measured at a wavelength (em) of 
475 nm. For the measurements, 10 uL of the peptide digest assay standard (1 mg/mL), 1.5 mL, 
tryptic digest of a protein standard, іп 50mM ammonium bicarbonate and 0.05% sodium azide 


provided with the kit and processed purified protein samples were transferred in a fluorescent- 
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compatible black 96-well microplate (Thermo Fisher). Peptide digest assay standard was 
prepared in 2-fold dilutions. Then, 70 uL of Fluorometric Peptide Assay Buffer was added to 
each well followed by 20 uL of Fluorometric Peptide Assay Reagent added to each well. After 
5 min of incubation time measurements were taken. The standard curve was used to determine 
the peptide concentration of each unknown sample. After quantifying the peptide abundance, 
each sample was checked using MALDI-TOF/TOF by using a-cyano-4-hydroxycinnamic acid 
matrix for the presence of contaminants. Then 10 uL of peptide samples described in 3.2.4 - 
3.2.5 were placed in mass spectrometry vials with inserts and subjected to LC-MS/MS on 
a QExactive plus Orbitrap mass spectrometer (Thermo Fisher) with a nanoESI interface in 
conjunction with an Ultimate 3000 RSLC nanoHPLC (Dionex Ultimate 3000). The LC system 
was equipped with an Acclaim Pepmap nano-trap column (Dinoex-C18, 100 À, 75 um x 2 cm) 
and an Acclaim Pepmap RSLC analytical column (Dinoex-C18, 100 А, 75 pm x 50 cm). The 
tryptic peptides were injected to the enrichment column at a flow of 5 uL/min of 2% ACN 
(v/v) containing 0.196 FA (v/v) for 5 min before the enrichment column was switched in-line 
with the analytical column. The eluents were 5% DMSO in 0.1% formic acid (v/v) (solvent A) 
and 5% DMSO in 100% ACN (v/v) and 0.196 FA (v/v) (solvent B). The flow gradient was (1) 
0-6 min at 3% B, (ii) 6-95 min, 3-22% B (iii) 95-105 min 22-40% B (iv) 105-110 min, 40-80% 
В (v) 110-115 min, 80-80% B (vi) 115-117 min, 80-3% and equilibrated at 3% B for 10 minutes 
before the next sample injection. The QExactive plus mass spectrometer was operated in the 
data-dependent mode, whereby full MS1 spectra were acquired in positive ion mode, 70 000 
resolution, automatic gain control target of 3e° and maximum injection time of 50 ms. Fifteen 
of the most intense peptide ions with charge states 22 and intensity threshold of 1.7e^ were 
isolated for MS/MS. The isolation window was set at 1.2 m/z and precursors fragmented using 
normalized collision energy of 30, 17 500 resolution, autonomic gain control target of 1e? and 
maximum IT time of 100 ms. Dynamic exclusion was set to be 30 s. Raw files were processed 
using Proteome Discoverer software v2.4 (Thermo Fisher) against the entire SwissProt human 


proteome database (TAXID-9606, v2019). 


3.2.9 LC-MS/MS analysis (breath-derived exosomes and whole breath) 


Samples were injected using the auto-sampler on an UltiMate 3000 liquid 
chromatography system (Thermo Fisher) connected inline to the Orbitrap Fusion™ Lumos™ 
Tribrid Mass Spectrometer (Thermo Fisher) fitted with the Nanospray Flex NG ion source. 
Xcalibur 4.2 software (Thermo Fisher) was used for method set up (Orbitrap HCD-Low Load 
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with peptide concentration less than 500 ng), control of the LC-MS/MS runs and data 
acquisition. Peptides were separated using a flow rate of 0.3 uL/min and fractionated with a 
C18 column (Dionex, LC Packings, Netherland). A 85-min buffer gradient was constructed 
from 0.1% buffer A (60% ACN, 40% high performance liquid chromatography-grade water 
(v/v), and 0.1% FA 10mM ammonium formate) and buffer B (0.196 FA in 80% ACN, v/v). 
This consisted of a linear gradient of 96% Buffer A to 4% Buffer B to 100% Buffer B over 60 
min. 100% Buffer B was maintained for 5 min, before dropping back to 4% Buffer B in 96% 
Buffer A over 5 minutes. The eluted peptides were ionized for detection by the Orbitrap 
Fusion™ Lumos™ Tribrid Mass Spectrometer (Lumos) by electron spray ionization with a 
silica tip emitter and the instrument’s conditions set to 2.5 kV voltage and transfer tube 
temperature of 275 °C. Ions with m/z range between 200 - 2000 m/z were analyzed during the 
85-min acquisition time using Fourier transform mass spectrometry (FTMS) in the Orbitrap, 
with data-dependent MS/MS on the top 6 intensity ions dynamically selected for collision- 
induced dissociation (CID) fragmentation and detection in the LTQ. The dynamic exclusion 
settings used were as follows: repeat count 2, repeat duration 30 s; exclusion list size 500; 
exclusion duration 90 s. A full scan (50 ms maximum injection time) in the FTMS at a 
resolution of 30,000 identified the 6 highest abundance ions and selected them for CID (1.5 
isolation width, normalized collision energy 35%, activation Q 0.25, activation time 30 ms) in 
the LTQ after accumulation of 500,000 ions with 10 ms maximum injection time was used as 
MS2 isolation option. Between sample runs was a wash of a gradient from 2% Buffer В to 98% 
Buffer B across 30 min. Mass spectra were analyzed with both Sequest HT and Mascot search 
methods using Proteome Discoverer software v2.4 (ThermoFisher) against the entire SwissProt 
human proteome database (TAXID-9606, v2019). Parameters were set to a maximum of 2 
missed cleavages with peptide lengths ranging from 5 to 144 amino acids selected. A precursor 
mass tolerance of 10 ppm and fragment mass tolerance of 0.8 Da with fixed cysteine 


carbamidomethylation modification. 


3.2.10 Lipid contamination removal (ethyl acetate wash) 


The wash followed the protocol that was used for period detergent removal from 
processed peptides samples?'?. First, water-saturated ethyl acetate was prepared by mixing 10 
mL of Milli-Q (Millipore) water and 80 mL of ethyl acetate іп а 125-mL glass bottle. The bottle 


was capped with its glass stopper and vigorously shaken for 1 min at RT breaking the water 
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into small droplets. The bottle was then set to rest for 5 min. Тһе shake-rest cycle was repeated 
3 times. The solution containing peptides were transferred to a glass vial. All following 
procedures were performed using a glass vial to avoid PEG contamination while using organic 
solvents. Then, 1 mL of water-saturated ethyl acetate mixture was added to the peptide 
solution, the cap was closed, and the contents vortexed vigorously on a shaker for 1 min at RT. 
The sample then was centrifuged for 15 sec at 12,000хо to separate the ethyl acetate phase 
from the aqueous phase. After organic phase separation, the upper level of ethyl acetate was 
carefully aspirated by using a glass Pasteur pipette. A small quantity of organic phase was left 
to prevent the loss from the aqueous peptide-containing phase. This cycle was repeated 3 times. 
After the final round of the cycle, the remaining small amount of organic phase was removed 
by placing the sample in a SpeedVac (Labconco CentriVap Concentrator, 78100-00) attached 
to the CentriVap Ultra-low cold Trap (Labconco, model 7811021; with specification -50 °C 
and 99.3% sample recovery) for 5 min at RT. 


3.2.11 Exosomes characterization by western blotting 


Cell- and breath-derived exosomes were assayed using western blotting to validate 
proposed biomarkers established after mass spectrometry analysis. Isolation of exosomes was 
performed as previously described in 2.4.3. Negative control samples free of exosomes were 
also assayed. Samples were suspended in lysis buffer (50 mM Tris-HCI pH 8.0, 150 mM NaCl, 
1% NP-40) and subjected to two rounds of vigorous vortexing (10 s) and cooling on ice (20- 
30 min). Samples were finally centrifuged at 15,000xg for 10 min. The pellets were discarded, 
and the protein concentration of the supernatant was determined using a micro-BCA assay 
(Pierce BCA Protein Assay Kit) according to the manufacturer’s instructions. Samples 
containing approximately 20 ug of protein were loaded оп а 10% acrylamide gel, and proteins 
separated by electrophoresis. The western blot was performed according to the protocol 
described by Lim*? with minor amendments: 5% BSA was used as a blocking agent and 
samples were heated for 15 min at 65 °C; the membrane was probed with several primary 
(monoclonal and polyclonal) antibodies as follows: primary monoclonal — Alix (1A12; Santa 
Cruz, ref. sc-53540) at 1:800; anti-CD63 (TS63; Invitrogen, ref. 10628D) at 1:1000 dilution; 
anti-CD81 (M38; Invitrogen, ref. 10630D) at 1:1000 dilution, and secondary mouse IgG kappa 
binding protein (m-IgGk BPP) conjugated to Horseradish Peroxidase (HRP) (Santa Cruz, ref. 
sc-516102) at 1:3000 dilution. Primary polyclonal unconjugated antibodies (all rabbit/IgG), 
all from Invitrogen: RhoC (ref. PA5-77866) at 1:300 dilution; MYOF (ref. РА5-53134) at 


83 


1:300 dilution; SLC2A8 (ref. AP5-42463) at 1:1000 dilution; CD151 (ref. РА5-78960) at 
1:1000 dilution, and secondary goat anti-rabbit IgG conjugated to HRP (Santa Cruz, ref. sc- 
2004) at 1:3000 dilution. All exosome (cell- and breath-derived) and whole breath samples 
were loaded on а 10% acrylamide gel as pure purified proteins at concentration 100 ng in total 
volume of 10 uL, while whole proteome samples were loaded as lysate at 20 ug in a total of 
20 uL. The membrane was visualized on a ChemiDoc Amersham Imager 600 (General 


Electric). 


3.2.12 Breath samples collection from volunteers 


Collection of breath samples was conducted through a study that has been approved by 
VUW human ethics committee (application #28700: Determining disease biomarkers in 
breath-derived exosomes for new diagnostic tools; approved September 9, 2020). The national 
Health and Disability Ethics Committee defined the project as being out of scope for their 
review. Both documents can be found in Appendix. 

Fifty-five participants (30 males and 25 females), who were non-smokers and free from 
known lung disorders or severe breathing disorders, participated in the study. Each participant 
was breathing into a tube for a maximum of 30 minutes, producing on average 1.5-2 mL of 
EBC. A total of 60 samples was collected, as five participants (four males and one female) 
donated breath twice and contributed to two samples. These 60 individual samples 
were divided into three equal groups, with samples pooled (Vio typically 34 mL) three days 
after collection. The third sample was collected from volunteers that did not participate in either 
of samples one or two. 

Given the situation related to COVID-19 at the time, we ensured that participants were 
not at-risk during sample collection. Before collecting the sample, the consent form was given 
to familiarize participants with procedure and steps. A standard operating procedure 
description was kept in the lab where principal investigator conducted the research. Each 
participant was given a sterile tube assembly (see attached picture showing the assembly), 


which was wrapped with an ice pack to keep it cool (Fig. 25). 
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Sterile 15 mL Falcon Ice pack Assembled and prechilled collection 


tube vessel. Each participant was given their 
(collection vessel) own. 


шши 


Sterile 10 mL syringe individually packed 
was given to participants on the 
collection day. This is what they will be 
breathing into. 
Fully assembled collection vessel. The breath condensate was collected from 
Plunger was set aside. walls by using a syringe plunger 


Figure 25. Collection vessel assembly for exhaled breath condensate. 


Sample collection required each participant to blow into a tube used to collect the 
condensate through a cooled collection vessel. The diameter of the tube allowed participants 
to breathe normally, although there is a small risk of developing dizziness. The list of 
participants was kept separately. The time, date, volume, and a label ‘Exhaled Breath 
Condensate’ of collected sample was written on the same collection vessel that a participant 
used during the study. Once a sample had been collected, it was stored at 4 °C and pooled with 
other samples approximately every 3 days. Combined samples were transferred into a new tube 
and re-labeled with the new date and volume only. This procedure continued until a minimum 


of 20 samples had been collected. After combining samples, the original tubes were discarded. 


3.2.13 Statistical analysis 


Analyses were performed using Prism GraphPad scientific 2D graphing and statistical 
software, version 9.1.1 (223) for macOS, April 16, 2021, San Diego, California. Regression 
analysis was used to fit a straight-line model for the Rapid Gold BCA protein assay standard 
curve. 

Area-proportional Venn diagrams were generated using BioVenn??? for the comparison 


and visualization of identified proteins in all samples used in this study. 
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STRING (v11.5)?'^ was used to visualize major protein complexes identified in the core 
of the breath exosome proteome. The network of identified proteins was constructed with 
minimum required interaction score with a high confidence threshold (STRING confidence 
score >0.7). The confidence score is the approximate probability computed by combining 
probabilities from the different evidence channels and corrected for the probability of randomly 
observing an interaction that a predicted link exists between two enzymes in the same 
metabolic map in the KEGG database. Disconnected nodes from the network were removed. 

GO enrichment analysis was performed using Shiny GO у0.61515 and g:Profiler (version 
е104 eg51 р15 3922dba)!!6. Shiny GO v0.61 is based on a large annotation database derived 
from Ensembl and STRING that allows graphical visualization of enrichment analysis and gene 
characteristics along with an ability to retrieve information of protein network and KEGG 
pathways. g:Profiler was used to perform gene ontology enrichment analysis based on 


statistical significance (p-value <0.05). 


3.3 RESULTS 


3.3.1 Optimization of analytical methods 


Developing standardized procedures for isolating high-quality and purity of intact breath 
exosomes was essential for identification of biomarkers. Previous research methods were 
adapted and optimized for this study. Remaining issues and limitations of the research were 
noted. 

The methodology was developed using cell-derived exosomes, providing reproducible 
access to a ready supply of exosomes for method optimisation. The most difficult part in this 
project was collecting sufficient exosome protein for proteomic characterization. It was 
important to maximize the yield of exosomes so that a sufficient amount of protein can be 
extracted for mass spectrometry analysis. Due to the low abundance of protein in exosomes in 
sample sizes used in the earlier characterisation work described in Chapter 2, it was obvious 
that it was necessary to adapt the previous isolation approach to collect a larger amount of 
exosomes. This was achieved by combining several samples together and using concentrators 
to reduce the volume of the starting material (CCM or EBC) and simultaneously to pellet 
different possible non-exosome protein aggregates from the exosomes. Initially, samples for 
proteomic analysis were isolated by SEC. However, when mass spectra were collected from 


several runs, it was not possible to analyze samples, as the amount of protein was so low that 
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no well- define peaks were present оп а chromatogram. SEC-isolated exosomes provided only 
half of the amount of protein obtained from UC-isolated exosomes. Based on results from 
Chapter 2 and the fact that optimization of UC method led to better purity, the UC isolation 
method was chosen as it most readily provided the required increase in the volume of the 
starting material (CCM and EBC). However, choosing an appropriate method for exosome 
isolation that results in higher yield was only the part of the problem. Protein loss was identified 
as the main challenge that needed to be overcome. It seemed that protein and peptide adsorption 
onto surfaces such as tubes and tips in the experiment occurred during the sample preparation 
leading to low recovery of proteins during sample preparation. Several solutions were 
evaluated that could prevent non-specific sample adsorption. For example, during the first step 
of protein extraction, several lysis buffers containing components such as 8 M urea, 6 M urea 
+2 M thiourea, and NP-40 were compared to determined highest product outcome after protein 
isolation, with the 8 M urea lysis buffer being the most effective. The concentrations of proteins 
in the samples were 0.868, 2.439, 0.535 mg/mL. During this step, it was found that the BCA 
assay is not practical for determination of protein concentration. Determination of protein 
concentration using the Lowry assay was also rejected due to lack of sensitivity for these low 
abundance samples. Finally, the Rapid Gold BCA assay was chosen as the most compatible 
with all chosen buffers, whilst offering the required sensitivity. 

At the next step, protein precipitation and solubilization of proteins, several protein 
precipitating reagents were tried, such as methanol and chloroform, or acetone. The greater 
protein recovery was achieved with the acetone precipitation method, with a total peptide 
quantification 934 ng vs 498 ng for an equivalent sample using methanol and chloroform. To 
investigate whether the organic solvent plays a difference in adsorption, simultaneous testing 
of different vials (regular Eppendorf tubes vs LoBind tubes vs glass vials) was conducted. 
While the application of detergents/organic solvents is required for the extraction of 
hydrophobic proteins and necessary to maintain proteins solubility in solution after their 
precipitation, organic solvents in the peptide sample can suppress the ionization signal and so 
must be removed prior to LC-MS/MS analysis. In this study, water-saturated ethyl-acetate 
wash applied immediately after digestion of proteins to remove lipid contamination. As shown 
later in the results section, this washing step played a positive role by increasing peptide signal 
intensity. It has also resulted in the excellent reproducibility of signal intensities for all samples. 

Although preference was given to traditional in-solution tryptic digestion, an attempt to 
try an alternative method of protein digestion was made. The decision was based on the 


hypothesis that low quantity of proteins might be caused by hydrophobic membrane proteins 
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that are difficult to re-dissolve after acetone precipitation. The gel-tube method described in 
3.2.5 is a novel approach aimed to enhance analysis of complex membrane proteins, was 
proposed as an alternative approach. This method skipped the precipitation step and suggested 
higher quantity of proteins after elution. 

To achieve reliable quantitative LC-MS/MS measurements, the injection volume for each 
sample was determined individually. The injection volume was selected based on the peptide 
concentration to prevent overloading the column, avoid broadening the peaks, and produce 
high-performance separation. To ensure that carryover was reduced during the mass 
spectrometry runs, different combinations of washing/flushing steps for the injection path were 


tested to remove residuals from previous samples and autosampler components. 


3.3.2 Evaluation of peptide recovery after traditional in-solution digestion and gel- 
tube methods on cell-derived exosomes 


To prepare a proteomic sample, the most common method, digestion in-solution was 
used. This method includes several steps of protein processing once they were harvested from 
an exosome sample: denaturation, reduction, alkylation, and digestion. 

After collection of exosome samples and validation of their presence and morphology as 
previously described in Chapter 2, proteins were precipitated and quantified. Due to the low 
abundance of proteins in exosomes, a suitable assay with the high sensitivity for protein 
quantification was required. Several protein quantification assays were tested (BCA, Rapid 
Gold BCA, and Lowry assays). Protein quantification from each sample was completed by 
highly sensitive and selective colorimetric detection of the cuprous cation (Cul+) by Rapid 
Gold bicinchoninic acid assay (Rapid Gold BCA) and Pierce Quantitative Fluorometric 
Peptide Assay (both ThermoFisher Scientific) quantification kit after the purification step to 
ensure that digestion was successful. The rapid Gold BCA assay was also optimized for protein 
quantification by conducting time-course experiments with 5-, 10-, 15-, and 30-minutes 
intervals, taking measurements after initial 5 minute. Standard curves for all data intervals were 
graphed and linear regression analysis was used. The smallest Sum of Squares (SS) equal to 
1.282 was determined for the 10-min interval, while values for 5-, 15-, and 30-min intervals 
were larger (2.000, 1.942, and 1.366 respectively). A value closer to zero indicates the smaller 
random error component suggesting that this fit will be more suitable. The coefficient of 
determination (77) values of different time intervals were also examined to establish which time 


interval result in the data showing the closest fit to the regression line. The 10-min interval had 
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the highest 7? value (0.997), while the 5-, 15-, and 30-min values were 0.987, 0.972, and 0.950 
respectively. Based on these results, the 10-min interval was selected for measurement, and it 


was advantageous as it gave a useable amount of signal for a short experimental time (Fig. 26). 
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Figure 26. Optimization of standard curve for Rapid Gold BCA Protein assay kit. The Y-axis shows the absorbance, and 


X-axis corresponds to the protein concentration (g/mL). The figure was generated by using Prism GraphPad Software 
(version 9.1.1) for macOS, San Diego, California. 


The deciding factor for successful proteomic analysis was the quantity of proteins after 
peptide digestion. Before moving on to the experimental part with breath-derived exosomes, 
the attempt to increase the number of proteins recovery after their isolation and peptide 
digestion was first tested on cell-derived exosomes. First, fractionated exosomes were collected 
as described in 2.4.4, and the proteins were quantified. During protein precipitation, new 
challenges were discovered. Unexpected loss of over 50% of proteins between the initial 
protein quantification and final peptide quantification occurred. This most likely occurred at 
the final step after tryptic digestion and purification using C18 reverse phase tips of peptide 
mixture (Fig. 27 A). 
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Figure 27. Protein quantification on purified proteins from SEC-based isolated cell-derived exosomes. The graphs 
showing (A) protein quantification on purified proteins from cell (A549 and WI-38)-derived exosomes before and after tryptic 
digestion for individual fractions 7-10 апа (B) difference in protein quantification before апа afier sample concentration 
showing on A549 cell-derived exosomes. The Y-axis corresponds to the protein concentration (ug/mL). The bar graph was 
constructed by plotting the mean of three biological replicates performed in three technical replicates and repeated three 
times. Error bars represent the standard deviation. The figure was generated by using Prism GraphPad Software (version 
9.1.1) for macOS, San Diego, California. 


The peptide quantification after tryptic digestion varied from 13 to 15 pg/mL that is 
almost 3-4 times less that the initial concentration of purified proteins. Although combining 
several individual fractions and concentrating the volume of digested proteins after elution 
noticeably increased the final concentration of protein in the sample (Fig. 27 B) that still was 
not sufficient to perform a high-quality mass-spectrometry analysis to identify proteins. To 
provide further sample, the number of cells was increased toward the more concentrated CCM 
enriched in exosomes (Appendix, Table S 1) and proteins from the exosomes were combined. 
Based on protein quantification data it was determined that at least 3.4x 107 cells were needed 
to obtain sufficient protein for mass spectrometric analysis, providing in excess of 100 mL of 
CCM. The increased volume of CCM from 34 mL to over 100 mL forced to a change of 
isolation method from SEC to UC, as UC is more convenient to operate with larger volumes. 


From this point, all exosomes were isolated by UC method as described in 2.4.3 (Fig. 28). 
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Figure 28. Cell (WI-38)-derived exosomes and their cellular content isolated Бу UC method. On the figure: cell-derived 


exosome pellets after (A) 1“ round of UC isolation and (B) 2"4 round of UC; (С) lyzed cell-derived exosomes with 8 M urea; 
(D) acetone-precipitated exosome protein. 


After finalizing the preferred method for exosome isolations, the proteins extracted from 
exosomes isolated by UC were quantified. As discussed in 3.3.1, an alternative method of 
protein digestion, the gel-tube method described in 3.2.5, was applied. Unexpectedly, the 
peptide quantification after protein digestion by the gel-tube method suggested a higher 


quantity of proteins was present after elution than was introduced to the gel-tube (Fig. 29). 
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Figure 29. Protein quantification оп purified proteins from UC-based isolated cell-derived exosomes. The graphs are 
showing protein quantification on purified proteins from cell (A549 and WI-38)-derived exosomes isolated by UC before and 
after tryptic digestion by (A) traditional in-solution digestion and (B) difference in protein quantification after tryptic 
digestion and gel-tube digestion. The Y-axis corresponds to the protein concentration (ug/mL). The bar graphs was 
constructed by plotting the mean of three biological replicates performed in three technical replicates and repeated three 


times. Error bars represent the standard deviation. The figure was generated by using Prism GraphPad Software (version 
9.1.1) for macOS, San Diego, California. 
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Based on the results reported in Figure 29, the final quantity of proteins after digestion 
by the gel-tube method was three times higher than proteins digested by traditional method. 
Further investigation of both methods by MALDI confirmed that more peaks in a mass range 
corresponding to peptide masses were observed after the gel-tube digestion method than after 


traditional digestion in-solution method (Fig. 30). 
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Figure 30. MALDI-TOF spectra of cell-derived exosomes. The figure is a representative of results of protein masses after 
purification with traditional method (in-solution digestion) for (A) WI-38 cell-derived exosomes and gel-tube method for 


cell-derived exosomes (B) WI-38 and (C) A549. The Y-axis corresponds to the signal intensity and the X-axis corresponds 
to mass (m/z). 


Surprisingly, after conducting LC-MS/MS analysis, no proteins could be identified from 
the samples processed by the gel-tube method. This suggested that substantial work would be 
required to optimize gel-tube digestion, or that it is not applicable for this type of biological 
sample (Fig. 31). 
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Figure 31. LC-MS/MS chromatogram of cell (A549)-derived exosomes digested by two different methods. The figure is 
showing LC-MS/MS chromatogram of A549 cell-derived exosomes (A) digested by gel-tube and (B) digested in-solution, where 
Y-axis corresponds to relative abundance of the detected ions, and X-axis corresponds to time. 
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Ultimately, it was decided to follow the standard, in-solution digestion, method with 
highly concentrated protein samples. Based on the reported protein quantity data from cell- 
derived exosomes, it was concluded that it is safe to apply the same approach on breath-derived 
exosomes. Peptide quantification was undertaken after digestion for all samples including 


exosomes and whole proteomes (Fig. 32). 
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Figure 32. Total protein concentration in the sample after digestion. Total protein (ug) after protein digestion for all 
exosome samples (cell- and breath-derived), whole proteome of A549 and WI-38 cell lines, and whole breath. The Y-axis 
shows the total protein (ug). The error bars represent the standard deviation of the biological replicates (n—3). The figure 
was generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


For the mass spectrometric analysis, three biological samples were prepared for each 
exosome and whole proteome sample. Each sample was run in 2 technical replicates and then 
analyzed by using Proteome Discoverer software v2.4 (ThermoFisher) where the data was 
searched against the entire SwissProt human protein database (TAXID=9606, v2019). The full 
list of proteins identified in each cell-derived exosome sample and whole proteome can be 
found in the Appendix (Tables S2-S5). The proteins identified as present from A549 and 
control WI-38 cell-derived exosomes were compared, as shown in Table 5 and Figure 33 


generated by using BioVenn software???. 


Table 5. Statistical data of proteins identified in cell-derived exosomes. The number of proteins identified in exosome samples 
from A549 and WI-38 cells showing total, unique, and overlapped proteins across two samples. Each group consisted of three 
biological replicate samples run in technical duplicates. 


Sample Total proteins identified Unique proteins Total overlap proteins 
A549 306 143 (30.82%) 
WI-38 321 158 (34.05) 163 (35.13%) 
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Figure 33. Area-proportional Venn diagram for proteins identified in cell-derived exosomes. Comparison diagram of 
proteins identified in lung cancer (4549) and healthy lung tissue (WI-38) cell-derived exosomes. Each group consisted of 
three biological replicate samples run in duplicate. Generated by using BioVenn software. 


The number of identified proteins in each group was very similar, as might be expected 
given samples contained equivalent protein abundances, starting from the equivalent cell 
counts and amount of CCM, as well as peptide abundance loaded onto the mass spectrometer. 
More than half of the proteins in each sample were in common between exosomes isolated 
from the two different cell sources. 

Gene ontology (GO) functional enrichment analysis was applied to proteins that were 
found in triplicate for exosome samples from each cell line (Table 6, Fig. 34). The 


bioinformatics analysis undertaken using ShinyGO v.0.61software?'?. 


Table 6. Enrichment analysis of proteins identified in cell-derived exosomes. The table shows analysis of proteins identified 
in A549 and WI-38 cell-derived exosomes based on hypergeometric distribution followed by FDR correction showing top 
three cellular components, molecular functions, and biological process. 


Cellular components 
A549 Руа WI-38 Pagj 
Extracellular exosome 4.50E-168 Extracellular exosome 7.71Е-206 
Extracellular vesicle 1.69E-165 Extracellular vesicle 5.30E-203 
Extracellular organelle 2.10E-165 Extracellular organelle 7.13E-203 
Molecular function 
Cell adhesion molecule binding 1.23E-44 Structural molecule activity 2.59E-42 
Structural molecule activity 5.18E-39 Extracellular matrix structural constituent 1.01Е-41 
Protein-containing complex binding 3.70E-26 Cell adhesion molecule binding 1.37E-35 


Biological process 


Establishment of localization 9.78E-30 Regulated exocytosis 1.04E-36 
Transport 4.97E-29 Exocytosis 7.40E-35 
Establishment of localization in cell 1.66E-27 Extracellular matrix organization 1.13E-32 
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Figure 34. Enrichment analysis of proteins identified in cell-derived exosomes. The figure is showing expression differences 
that are consistent across sets of genes identified in (A, B, and C) A549 and (C, D, and E) WI-38 cell-derived exosomes based 
on hypergeometric distribution followed by FDR correction showing (A, D) cellular components, (B, E) molecular functions 
(МЕ), апа (C, F) biological process (BP); p -value cut-off (FDR, Benjamini and Hochberg *!’) —0.05. Two nodes are 
connected if they share as least 20% or more genes. Less transparent nodes are more significantly enriched gene sets. Bigger 
nodes represent larger gene sets. Thicker edges represent more overlapped genes. 
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Notably, the three most significant terms in cellular components for both study groups 
are the same and following the same order: extracellular organelle, extracellular vesicles, and 
extracellular exosomes, which confirms that analysis was indeed conducted on exosomes and 
not any other subtypes of EVs. The molecular function ontologies associated with the two cell 
lines were similar in components but with different relative enrichments for structural molecule 
activity and cell adhesion molecule binding. As expected, the biological function ontologies 
were completely different, which can be explained by the nature of the two cell lines. While 
the vesicle-mediated transport BP ontology was strongly enriched with the lung cancer 
mimicking A549 cells, terms related to extracellular structure organization were most enriched 
for exosomes isolated from WI-38 cells, mimicking normal lung tissue. The full list of GO 
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analysis for both cell-derived exosomes undertaken using g:Profiler software?! can be found 


in Appendix, Tables S 6 and S 7. 


3.3.3 Selection of lung cancer biomarkers 


The search for potential biomarkers for lung cancer began with the 143 proteins that were 
unique to exosomes from A549 cells. Briefly, proteins that were found at least three times were 
selected as main candidates and proteins found at least twice were also selected. Then, selection 
of candidates was continuing based on the sum of Posterior Error Probability (PEP) score for 
each protein in each replicate. Based on PEP score, the raw list of proteins was narrowed down 
from 143 proteins to 22 proteins (Appendix, Table S 8) that were found twice and only 11 
unique proteins that were found at least three times in A549 cell-derived exosomes but were 
not in any WI-38 cell-derived exosomes. After validation of exosomes through the functional 
GO enrichment analysis, the identified proteins from A549 cell-derived exosomes were 
checked to identify whether they had been previously identified as cancer markers in literature. 
The preliminary list of nine potential biomarkers was further refined based on the antibody 
availability and on the strength of literature evidence to support application as a prognostic 
marker, ideally in lung cancer, and likely abundance in exosome samples (Table 7). The 
availability of antibodies against the potential biomarker was also considered, as these will be 


required for the FRET-based assay design that is described in Chapter 4. 
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Table 7. Preliminary list of biomarker candidates. Тһе list of biomarker candidates based оп the number of replicates each 
were found in A549 cell-derived exosome (from a total of 3 experimental replicates), sum of PEP score, and types of cancer 
in which the protein has been previously identified. 


Protein Description Gene # of Sum Cancers in which the protein has 
accession code replicates of PEP been previously identified. 
number score 
Q9NZMI Myoferlin MYOF 3 38.13 Bladder, colorectal, ovarian, prostate 
P11021 78 kDa glucose-regulated protein НЅРА5 3 394 Bladder, breast, colorectal, liver, 
ovarian, prostate 
P02774 vitamin D-binding protein GC 3 3211 Colorectal, prostate 
P01116 GTPase KRAS KRAS 3 1.9 Colon, breast, liver, bladder, lung 
P48509 CDI51 antigen CDI51 3 1.678 Prostate, breast, colon, NSLC, 
pancreatic 
P15151-1 Poliovirus receptor PVR 3 1.97 Colorectal, breast, prostate, lung, liver 
P11166 Solute carrier family 2, SLC2A 3 5.35 Colorectal, breast, prostate, lung, liver 
facilitated glucose transporter / 
member 1 GLUT 
P08134 Rho-related GTP-binding protein RHOC 3 5.5 Colorectal, breast, prostate, lung, liver 
RhoC 
P60953 Cell division control protein 42 CDC42 3 5.864 Colorectal, breast, prostate, lung, liver 
homolog 


From this, four candidate protein-biomarkers were prioritized: CD151 antigen (CD151), 
solute carrier family 2-facilitated glucose transporter member | (SLC2A1, also known as 
GLUT), myoferlin (MYOF), and Rho-related GTP-binding protein RhoC (RhoC) (Table 8). 


Table 8. Literature review of selected biomarkers identified in A549 cells and their association with cancer origin. 


Gene code Cancer origin Gene code Cancer origin 

CDI51 Colorectal 5'* MYOF Breast 519, 520 
Liver 521,522 Colon 525 
Lung 524 Lung 525, 526 
Melanoma 527 Pancreatic 528 
Ovarian 529 RhoC Bladder 530 
Pancreatic 5! Breast 532 536 
Prostate 537 Gastric 555 559 

SLC2A1 (GLUT) Breast 20 Liver 54! 
Colon 542 Lung 543, 544 
Gastric 545546 Ovarian 547 
Lung 48552 Pancreatic 553 

Renal 554 


CDI51 enhances the motility, invasion and metastasis of cancer cells^*. GLUT is 
responsible for increased cellular glucose uptake and increased aerobic glycolysis in recipient 
tumor cells. MYOF plays a role in the plasmalemma repair mechanism of endothelial cells that 
permits rapid resealing of membranes disrupted by mechanical stress and is involved in 


endocytic recycling. Finally, RhoC regulates a signal transduction pathway linking plasma 
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membrane receptors to the assembly of focal adhesions and actin stress fibers and serves as a 
microtubule-dependent signal that is required for the myosin contractile ring formation during 
cell cycle cytokinesis. 

Some proteins (CDC42, KRAS, and PVR) in Table 8 had a higher score than the chosen 
biomarker candidates but were not selected for the final round of candidates. All three proteins 
are generally considered for cancer therapy rather than diagnostics. For example, CDC42 is 
most often mutated in cancer and could be overexpressed via signalling through oncogenic 
receptor and so not considered as a diagnostic marker555. KRAS is also considered as mutated 
oncogene and rather highlights as high-priority therapeutic target556557_ Finally, although RVP 
is highly expressed in several types of cancer including lung cancer, but it is more common to 
find it in ovarian and colorectal cancer. In addition, PVR is gaining more scientific interest as 
therapeutic target??*. That being said, these proteins can be considered as a back-up option to 
the selected biomarkers and could be considered alongside the four selected biomarkers in 
future clinical studies involving lung cancer patients. 

These proposed biomarkers were validated through western blotting. The selectivity of 
each marker between exosomes from A549 and WI-38 cells was analyzed individually in 
triplicate (Appendix, Figs. S 1- S 10), as well as through a combined cocktail of markers (Fig. 
35). 
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Figure 35. Validation of proposed biomarkers CD151, GLUT, MYOF, and RhoC. Western blot showing CD151, GLUT, 
MYOF, and RhoC blotting on A549 exosome lysate апа cell (whole proteome) lysate. Note, that markers are only shown in 
cancer cell-derived exosomes and cancer cell model lysates and absent in WI-38 cell- and breath-derived exosomes. Western 
blotting of general tetraspanin EV markers (Alix, CD63, and CD81) are also shown. 
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According to the western blot results, (һе proposed lung сапсег protein biomarkers were 
only identified in A549 cell-derived exosomes and its whole proteome, which was expected. 
WI-38 cell-derived exosomes, WI-38 whole proteome, and breath-derived exosomes, playing 
the role of controls, did not show any of the proposed lung cancer markers, verifying that 
proposed biomarkers are able to distinguish between cancer and healthy lung exosomes. As 
exosomes are time consuming to isolate, it was useful to know whether markers on exosomes 
could be identified from within whole breath condensate. Therefore, exosome samples, 
including those derived from a sample of human breath, were compared to whole breath using 


general tetraspanin EV markers (Fig. 36). 
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Figure 36. Validation of exosomes by western blotting. The cell (4549, WI-38)- and breath-derived exosomes were validated 


using general tetraspanin EV markers (Alix, CD63, and CD81) in comparison to whole breath and cellular (WI-38/A549) 
whole proteome. 


Disappointingly, the general tetraspanin EV markers (Alix, CD63, and CD81) could not 
be identified in whole breath, although they were apparent in all purified exosome samples. 
This may indicate that it will be necessary to go through at least a step of exosome enrichment 


as part of any future exosome biomarker-based diagnostic device. 
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3.3.4 Characterization of cellular whole proteome 


The whole proteome from the A549 and WI-38 cells was also compared. The analysis 
reported that 646 proteins were identified as common proteins across case and control study. 
The unique proteins identified in WI-38 whole proteome will be later compared with proteins 
identified in whole breath of volunteers participated in the determination of benchmark breath 


proteome. The full list of proteins can be seen in Appendix, Tables 5 4 and 5 5. 


3.3.5 Characterization of breath-derived exosomes (preliminary studies) 


After collection of exhaled breath condensate, breath-derived exosomes were isolated as 
described in 2.4.3 (Fig. 37). 


Figure 37. Characterization of breath-derived exosomes. On the figure: (A) washed pelleted breath-derived exosomes by 
UC method and (B) acetone precipitated proteins from the same sample. 


Based on the optimized protocol for increasing protein concentration after digestion, 
samples (x20) were pooled together in one. Three biological replicate samples were prepared 
to go through identification analysis. A 1 ul aliquot from each digested protein sample was 
combined with 1 ul of a-cyano-4-hydroxycinnamic acid matrix and air-dried prior to analysis 
by MALDI-TOF-MS described in 3.2.6. Then samples were subjected to MS analysis, after 
which it has been noted that the chromatogram from some samples had high signal intensity at 
longer retention times. It was presumed that this arose from lipid contamination. It seemed that 


the lipid: protein content ratio was higher than in cell-derived exosomes. Samples (Fig. 38) 
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then were washed with ethyl acetate as described іп 3.2.9 and were re-probed again Бу using 
MALDI-TOF-MS as described above. 
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Figure 38. MALDI-TOF spectra of breath-derived exosomes. The spectra are representative of sample (A) before and (B) 
after an ethyl acetate wash. Highlighted peaks circled before wash and disappeared afier the wash. The Y-axis corresponds 
to the signal intensity and the X-axis corresponds to mass (m/z). 


Based on the spectrum, large peaks that were previously noticed around 1600 m/z have 
noticeably decreased. Comparing the results of MALDI spectrum before (Fig. 38 A) and after 
(Fig. 38 B) it was clear that the contamination is much reduced by the ethyl acetate wash. 

Breath-derived exosome samples named as ЕІ, E2, апа ЕЗ and were subjected to MS 
analysis. Samples were run in two technical duplicates to make sure that the injection volume 
is sufficient for protein identification. The injection volumes were adjusted to deliver consistent 
peptide amounts to the mass spectrometric analysis (Appendix, Table S 9). Statistical data 


collected after analyzing samples through PD software is shown on Table 9. 


Table 9. Statistical data of proteins identified in breath-derived exosomes. The number of proteins identified in exosome 
samples from the same individual showing total, unique, and overlapped proteins across three samples. Each group consisted 
of three biological replicate samples run in technical duplicates. 


Sample Total proteins Unique proteins Shared proteins Overlap proteins 
identified across 3 samples 
El 504 181(21.22%) 34 (3.99%) (Е1+Е2) 
Е2 575 176 (20.63%) 135(15.83%) (E2+E3) 230 (26.96%) 
ЕЗ 462 38(4.45%) 59 (6.92%) (Е1--ЕЗ) 
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Тһе similarities апа differences between samples is shown Бу а Venn diagram іп Figure 


39, generated by using BioVenn software5!3. 


Figure 39. Area-proportional Venn diagram for proteins identified in breath-derived exosome samples from the same 
individual. Comparison diagram of proteins identified in breath-derived exosomes from the same individual across 3 
biological samples (E1, E2, and E3). Each biological replicate sample run in duplicate. Generated by using BioVenn software. 


To support the fact that the number of total proteins identified in each sample was slightly 
different, the chromatogram from each sample was analyzed (Fig. 40). These differences 


between the samples from the same individual may relate to the fact that they were collected 


at different times. 
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Figure 40. Chromatogram of breath-derived exosome samples. The chromatogram represents three biological samples: (А) 
El, (B) E2, and (C) ЕЗ. 
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3.3.6 Characterization of whole breath (preliminary studies) 


To define if there are any common proteins between EBC and breath exosomes, the whole 
breath proteome was also obtained as described in 3.2.2. The total number of proteins identified 


are summarized in Table 10 and Figure 41. 


Table 10. Statistical data of proteins identified in whole breath. The number of proteins identified in whole breath samples 
from the same individual showing total, unique, and overlapped proteins across three samples. Each group consisted of three 
biological replicate samples run in technical duplicates. 


Sample Total proteins Unique proteins Shared proteins Overlap proteins 
identified across 3 samples 

УҮНВІ 518 124 (17.46%) 107 (15.07%) (WHB1+WHB2) 

WHB2 461 80 (11.27%) 18 (2.54%) (WHB2+WHB3) 256 (36.06%) 

WHB3 399 94 (13.24%) 31 (4.37%) (WHB1+WHB3) 


Figure 41. Area-proportional Venn diagram for proteins identified in whole breath samples from the same individual. 
Comparison diagram of proteins identified in whole breath from the same individual across 3 biological samples (WHB1, 
WHB2, and WHB3). Each biological replicate sample run in duplicate. Generated by using BioVenn software. 


According to the data analysis, 709 proteins were identified at least once, and 410 
proteins were identified at least twice. 

Finally, a comparison was made between proteins found in breath exosomes and those 
found in the whole breath sample (Table 11, Fig. 42). 
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Table 11. Statistical data of proteins identified in combined samples of breath-derived exosomes and combined samples of 
whole breath from the same individual. The number of proteins identified in combined individual samples of breath-derived 
exosomes and whole breath samples from the same individual showing total, unique, and overlapped proteins across two 
samples. Each group consisted of three biological replicate samples run in technical duplicates. 


Sample Total proteins identified Unique proteins Total overlap proteins 
E combined 786 503 (43.78%) 


283 (24.63%) 


WHB combined 646 363 (31.59%) 


Figure 42. Area-proportional Venn diagram for proteins identified in combined individual samples of breath-derived 
exosomes and whole breath samples from the same individual. Comparison diagram of proteins identified in all combined 
samples of breath-derived exosomes (E combined) and whole breath (WHB combined) from the same individual. Each 
biological replicate sample run in duplicate. Generated by using BioVenn software. 


This preliminary study of breath-derived exosomes and whole breath was conducted on 
samples collected from a single individual. It allowed methodology to be optimized ready for 
application on creating the benchmark proteome of breath exosomes from a healthy population. 
As the data represents only a single individual, a detailed analysis of the proteins identified was 
not undertaken. Results of the larger study on exosomes collected from the breath of healthy 


volunteers’ study will be presented in next section. 
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3.3.7 Determination of the benchmark proteome of breath exosomes 


Breath-derived exosomes from EBC were isolated by ultracentrifugation and assessed by 


western blotting with general tetraspanin EV markers (Fig. 43 А) and cryo- SEM (Fig. 43 В). 
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Figure 43. Characterization of breath-derived exosomes by western blot analysis and Cryo-SEM. The validation of breath- 
derived exosomes was confirmed by (A) western blotting using general tetraspanin EV markers-Alix, CD63, and CD81. (B) 
The size and morphology of exosomes were validated by cryo-SEM imaging. Scale bar 100 nm . 


As described in sections 2.3.4 and 3.2.10, isolated exosomes were grouped into three 
samples that were individually subjected to proteomic analysis. A total of 730 unique proteins 
were identified (Appendix, Table S 10). As shown in Table 12, there was a large difference in 


the number of proteins identified between samples. 


Table 12. Statistical data of proteins identified in breath-derived exosomes collected from volunteers. The number of proteins 
identified in exosome samples from volunteers showing total, unique, and overlapped proteins across three samples. Each 
group consisted of three biological replicate samples run in technical duplicates. 


Sample Total proteins Unique proteins Shared proteins Overlap proteins 
identified across 3 samples 
51 233 62 (8.53%) 41 (5.64%) (S1+S2) 
82 610 321 (44.15%) 126 (17.33%) (S243) 122 (16.78%) 
S3 303 47 (6.46%) 8 (1.1%) (51-53) 


This may arise as these are low abundance samples, with many peptide signals being 


around the limits of detection of the mass spectrometer. 
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From 730 unique proteins, 297 proteins were found in at least twice across three samples 
as shown in Figure 44 (Appendix, Table S 11). These 297 proteins were defined as the core of 


the breath exosome proteome (BEP). 


Figure 44. Area-proportional Venn diagram for proteins identified in breath-derived exosome samples from volunteers. 
Comparison diagram of proteins identified in breath-derived exosomes from volunteers across 3 biological samples (S1, S2, 
апа 83). Each biological replicate sample run in duplicate. Generated by using BioVenn software. 


The functional gene ontology (GO) enrichment analysis of the core BEP was performed 
using g:Profiler database?! for functional interpretation of gene list. Although the core of the 
BEP was identified consisting of 297 proteins, 5 proteins (Immunoglobulin lambda constant 7 
(A0M8Q6), Immunoglobulin kappa constant (Р018341), Immunoglobulin heavy constant 
gamma 3 (P01860), IgGFc-binding protein (Q9Y6R7), and Immunoglobulin heavy variable 
3/OR16-9 (non-functional) (S4R460)) were not included in the analysis as they were absent 
from the g:Profiler database. The GO enrichment analysis revealed that of the 292 proteins, 
235 have a GO cellular component annotation to the extracellular exosome, being the most 
strongly enriched term in this analysis (Раа 3.43x10:!66), The top three GO biological process 
annotations are immune effector process (132 terms, Pag; 2.50x1075), leukocyte mediated 
immunity (120 terms, Раа; 1.56x1077), and regulated exocytosis (113 terms, Раа) 5.05х 10-75). 


GO molecular function and cellular components are shown on Figures 45 B and C respectively. 
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Figure 45. The functional GO enrichment analysis of the core of the breath exosome proteome. Results of functional 
enrichment analysis to examine the (A) biological process, (B) molecular function, and (C) cellular component of the core 
BEP proteins. The subset of the core BEP consisting of 184 proteins (GO: BP) is shown under D. The Y-axis represents term, 
and X-axis represents negative logo of adjusted p-value. The figure was generated by using Prism GraphPad Software 
(version 9.1.1) for macOS, San Diego, California. 


Among these proteins, identified as the core of the breath exosome proteome, a few of 
them were associated with exocytosis. There were considerable overlaps between proteins 
annotated to many of the biological process ontologies. As exosomes are formed through a 
regulated exocytosis pathway, we were particularly concerned that some biological processes 
may have been reported as significantly enriched only because they shared proteins with the 
regulated exocytosis ontology. Removing the exocytosis-associated proteins removed those 
other apparent enrichments, revealing more relevant biological processes associated with the 
identified proteins. For that reason, the list of 297 proteins were narrowed down and a follow- 
up GO functional enrichment analysis was completed with the 184-protein subset (Appendix, 
Table $12) of the core BEP proteins, those not annotated to the regulated exocytosis term. 
Within this subset of proteins, the most enriched cellular component ontology is extracellular 
space (158 terms, Paaj 3.94x 105?) with the most significant parental biological processes being 
humoral immune response (41 terms, Paaj 6.36х 107%), humoral immune response mediated by 
circulating immunoglobulin (29 terms, Pa 2.21 x 10??), and complement activation (26 terms, 
Paaj 5.11x107?), A full list of GO term enrichments undertaken using g:Profiler database?! is 
in the Appendix, Tables S 13-14. Although the purification gave high quality exosomes free of 


protein contamination, there is a contribution from proteins associated with the salivary system. 


107 


Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis using KEGG 
database??? for both the core BEP and the 184-protein subset revealed the contribution of the 


salivary system, with 12 proteins annotated to salivary secretion in the core BEP (Fig. 46, 
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Figure 46. KEGG pathway of proteins associated with salivary secretion. The proteins activated in salivary secretion are 
highlighted in red boxes. The figure is generated by using ShinyGO v.0.61 software. 


Among those 12 proteins, 10 proteins were found in the 184-protein subset excluding 
alpha-amylase 1А (АМҮ) and cathelicidin (LL-37). However, this number may be higher as 
our pathway analysis only covered 179 of the core BEP proteins and 97 of the smaller subset. 
Among those salivary proteins, four major protein families (cystatins (CST1, CST2, CST4, 
CST5), histatin (HTNI, HTN3), mucin (MUCSB), and statherin (STATH)) were found. The 
proteins from the first two families can be found in saliva and have been produced by parotid 
and submandibular glands, which are the most important glands responsible for saliva 
production?9), In addition, mucin is thought contribute to the lubricating and viscoelastic 


561 


properties of whole saliva??!. Statherin plays a role in stabilizing saliva supersaturated with 


calcium salts by inhibiting the precipitation of calcium phosphate salts562. 
To fully understand of the biological processes associated with the proteome of breath 
exosomes, the STRING database?!^ was used to visualize the protein network, as well as 


functional associations (Fig. 47). 
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Figure 47. The network of identified proteins of the соге ВЕР constructed by STRING. The proteins in the network are 
represented with high confidence with major clusters highlighted in yellow (Keratin), red (Arp2/3 protein complex), purple 
(mixed cluster including Tetraspanin family and Annexin), green including dark green (Glycolysis), and teal (RhoGTPase 
proteins involved in activation of WASPs and WAVEs and actin binding). The connections indicate that these proteins are part 
of a physical complex, and the thickness indicates the strength of the connection within the complex showing their confidence. 
Disconnected nodes from the network were removed. 


The first largest cluster is the RhoGTPase cluster, proteins of which play an important 
role in exosome biogenesis by regulating protein transport and membrane recycling from 
endosomal compartments. The second largest cluster consists of proteins associated with 
glycolysis and the pentose phosphate pathway. Both complexes are involved in the glucose 
metabolism?9^, Further, the pentose phosphate pathway protein complex is a major key in 


5 


exosome-mediated intracellular communication?9. It is also important to mention that 


simultaneous inhibition of the glycolysis and oxidative phosphorylation (not shown) triggers 


109 


increase of exosomes secretion?9, Moreover, glycolytic enzymes are involved іп ATP 
production that can be easily affected during cancer progression due to a decrease of ATPase 
activity??". The third largest cluster of proteins is associated with the tetraspanin family and 
annexin, proteins of which are concentrated on the plasma membrane of EVs and play a 
fundamental role in EV biogenesis’. The fourth protein cluster consists of the Arp2/3 protein 
complex, which is shared with the RhoGTPase cluster described above. The ARP2/3 complex 
is involved in the formation of actin branches. Interestingly, this complex has been previously 


569,570 and EVs release?"'. The keratin complex 


reported as one ofthe key players in endocytosis 
has been also highlighted in the network of identified proteins as a major constituent of proteins 
found in the breath-derived exosomes. 

Finding similarities and differences that exist among different types of exosomes piqued 
our interest, and comparison was made with literature data of the urinary exosome proteome, 
a source chosen because it is also collected by a non-invasive approach. Identification and 
proteomic profiling of exosomes in human urine has been studied for decades resulting in a 


database of 1160 proteins?07572 


. First, proteins identified in the core of the breath exosome 
proteome and urine-derived exosomes were compared to derive the profile similarities (Fig. 


48). 


Figure 48. Area-proportional Venn diagram for proteins identified in breath-derived and urinary exosomes. Comparison 
diagram of proteins identified in breath-derived exosomes from volunteers and urinary exosomes across two samples. The 
proteins identified in urinary exosomes used in this figure was taken from previous published database", Generated by 
using BioVenn software. 


It appeared that almost 40% of proteins identified in breath exosomes are also present іп 


urinary exosomes. Then, we decided to look at unique families of proteins identified in urine- 
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апа breath-derived exosomes (Fig. 49). А significant similarity would suggest that the proteins 


are related. 
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Figure 49. The functional GO enrichment analysis of urine and the core of the breath-derived exosome proteome. The 
diagram is showing only BP component of unique proteins identified in (A) urine exosomes and (B) the core of the BEP. Two 
pathways (nodes) are connected if they share 20% or more genes. Less transparent nodes are more significantly enriched 
gene sets. Bigger nodes represent larger gene sets. Thicker edges represent more overlapping genes. The figure was generated 
by ShinyGo v0.61 sofiware. 


The GO analysis of the biological component identified more similarities than expected, 
highlighting the same terms with slightly different enrichment values. The main difference 
identified among terms of the biological component was the leading term. For urine exosomes 
the most enriched biological component ontology was vesicle-mediated transport (276 terms, 
Раа 6.57X 10), while for the core of breath exosomes it was immune effector process (132 
terms, Pag 2.50x1075). The analysis of shared proteins identified five members of the 
component of the chaperonin-containing T-complex that plays role in regulating transport 
vesicles to cilia and actin folding??. The analysis of molecular function identified two 
similarities: the groups of proteins associated with "cell adhesion molecule binding" and 


"cadherin binding". 
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3.3.8 Other co-isolated vesicles 


During TEM imaging of breath-derived exosomes, some larger vesicles were co-isolated 
with exosomes. Based on their size range (200-500nm) and shape, they potentially could be 


considered as microvesicles or larger extracellular vesicles (Fig. 50 A and B). 


Figure 50. Characterization of larger EVs by TEM and STEM-EDS. TEM images of larger EVs (A-B) eluted with exosomes 
fraction after ultracentrifugation ;scale bar 500 nm. (С) The STEM-EDS analysis of a single vesicle found in breath (1) 
carbon, (2) nitrogen, (3) phosphorus, and (4) uranium. Scale bar 250 nm. 


STEM-EDS was performed to distinguish their origin. The results are shown in Figure 
50 C. The sample was placed on a copper grid and stained with uranyl acetate, explaining the 
presence of these otherwise unexpected elements in the EDS spectrum. The presence of 
abundant carbon and oxygen that maps as co-located with larger vesicles is consistent with it 
being of organic origin. At this point no further investigation was done as it was not the aim of 


this study. However, the findings were noted for further studies. 


3.3.9 Comparison of cell- and breath-derived exosomes 


To consider whether the exosomes derived from cells make a good model of breath 
exosomes, and to determine the nature of differences between them, the proteins identified 


from these two sources compared with each other (Fig. 51). 
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Figure 51. Area-proportional Venn diagram for proteins identified in cell- and the core of the breath-derived exosome 
proteome from volunteers. Comparison diagram of proteins identified in cell (A549 and WI-38)-derived exosome and the core 
of the breath-derived exosome proteome from volunteers (S). Each biological replicate sample run in duplicate. Generated by 
using BioVenn software. 

Of the 297 proteins identified as the core of the breath exosomes proteome, relatively 
few are found in common with the exosomes identified from the cell models. It is likely that 
with larger samples and repeated proteome determination, the apparent disparity between cell 
and breath exosomes will decrease. However, the difference between the two sample types is 
also likely to reflect the range of cell and tissue types contributing to breath exosomes, which 


will be much wider than the lung cells represented by the A549 and WI-38 models. 


3.5 DISCUSSION 


Lung cancer is the leading cause of cancer death205:574. The frequency of diagnosis of 
early stages when curative treatment is possible is very low compared to the more advanced 
stages??. This is because early stages are asymptomatic and, additionally, the current 
diagnostic methods discussed in 1.3.2 are unable to detect the smallest tumors. To date, there 
are no biomarkers or biomarker-based screening assays that can accurately predict lung cancer 
development in early stages. Therefore, there is clearly a high need for the identification of 
biomarkers and the development of new analytical tools that can reduce the cancer burden 
through its early detection. For this reason, this study proposes the understanding of the breath 
exosomes collected from EBC through evaluation of their protein cargo. 

Results from this chapter identified potential lung cancer biomarkers are only found in 


exosomes derived from A549 cells, used as a model of lung cancer. They were also in the 
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whole proteome of A549 cells and were absent in negative controls (the WI-38 cell model of 
normal lung and breath-derived exosomes from healthy volunteers). 

It was first necessary to increase the amount of protein within analyzed samples. That 
includes detailed analysis of each step during protein extraction and troubleshooting for the 
optimization of the process. Among those, selection of an appropriate buffer solution for 
protein extraction, the volume of starting solution from which exosomes will be extracted, and 
the method of protein digestion were highlighted. The following issues were also identified in 
the literature review as discussed in 3.1.2. The selection of an appropriate buffer solution 
played an important role of separating proteins from other non-relative cellular components, 
such as lipids, by protect the integrity of proteins. It was important to find out which lysis 
buffer is most compatible with the exosome proteins. In previously reported studies the choice 


55 а combination of urea and 


of lysis buffer varied, most commonly used lysis buffers are urea’ 
thiourea?75, КІРА buffer, NP-4057, and guanidium chloride. The choice of the buffer will 
always be dictated by the nature of the sample from which proteins will be extracted. Use of 
thiourea provides a potent denaturant that can improve the solvation efficiency and so gives a 
high yield of proteins. However, this buffer is more applicable to use in gel-digestion method 
rather than in solution??. Two comparative studies on the relative benefits of buffers such as 
NP-40, SDS, 8 M urea??? (1* study) and ВІРА, SDS, 8 M urea š! (214 study) have concluded 
that 8 M urea is the most generally reliable chaotropic agent in such lysis buffers. Based on 
these reports, the most common buffers (NP-40, 8 M urea, and combination of urea/thiourea) 
were also tested on exosome samples reporting 8 M urea buffer as a preferable choice. 

The complication with the volume of starting solution from which exosomes were 
extracted was resolved through gradually increasing the cell count until the ideal number of 
total cells was found and through selection of the most suitable lysis buffer. However, that only 
partially solved the challenge. With breath-derived exosomes, it was not possible to apply the 
same methodological approach. At this stage, EBC was collected at different volumes and 
proteins were quantified each time until it reached the number close to the number of proteins 
identified from cell-derived exosomes. That being said, the volume of the starting solution 
containing exosomes was not the only factor responsible for the low protein recovery. 

The discovery of the new putative biomarkers in this study was adopted and pursued 
through the analysis of protein cargo of the case (A549) and control (WI-38) cell-derived 
exosomes that then was compared to protein set of breath-derived exosomes. As a result of this 
comparison, the identified biomarkers were able evaluate the correlation between two protein 


sets contributed to distinguishing lung dysfunctionality from healthy lungs. Along with this, 
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the benchmark proteome of exosomes isolated from the breath of healthy volunteers was fully 
characterized for the first time. It must be noted that LC-MS/MS analysis has its own limitation 
in detection of low-abundance proteins that still remains challenging. The results of this study 
suggest that the reliability of the data obtained from a sample will depend on the quality of the 
sample preparation. We cannot forget the fact that exosomes are expected to be rich in lipid- 
associated proteins, which may be inherently difficult to resolubilize after acetone 
precipitation. Further, the peptides obtained from tryptic digestion may be similarly 
hydrophobic and readily lost through precipitation or retention on the C18 micropipettes. 

Analternative method of protein digestion, the gel-tube method, was proposed and tested. 
The gel-tube system sets a sample in acrylamide followed by polymerization. The sample is 
washed in the gel, and so is never precipitated. Tryptic digestion was undertaken following 
methodology used for in-gel digestion. This method is reported to improve sample recovery 
with higher efficiency through in-gel protein trapping. A study described by Lu et al582 
compared gel-tube and SDS-polyacrylamide gel electrophoresis methods, reporting that gel- 
tube provides a better sequence coverage for peptides from membrane proteins extracted from 
prostate cancer cells. The efficiency of the gel-tube method for characterization of membrane 
proteins was further validated in comparison to in-solution digestion583584585 Despite the 
relative ease of implementation and reported positive outcome, this method was not successful 
for breath-derived exosomes as no proteins could be identified following this procedure. After 
numerous attempts to address the issues with protein recovery, it was decided to proceed with 
the standard method (digestion in-solution) but with an increase of the sample concentration to 
avoid insufficient number of proteins that can further compromise mass spectrometry analysis. 
This approach was first used on the preliminary study of breath-derived exosomes and was 
described in 3.3.5. 

Whilst numerous protein biomarkers have been already discovered and reported from 
different sources of exosomes (Table 1), there is no information on breath exosome proteome 
that can shed light on new candidate biomarkers. Breath should be considered as an open 
avenue that offers a huge potential for a non-invasive identification of promising biomarkers. 
This study has revealed that the protein cargo of breath exosomes can be used to distinguish 
changes in protein sets between a lung cancer cell model and exosomes from healthy lung cell 
model suggesting that breath exosomes can be used to evaluate lung health becoming a serious 
competitor to plasma exosomes. To answer the question whether the proposed biomarkers can 
provide an accurate diagnosis of lung cancer, the proteome of clinical patients diagnosed with 


lung cancer, including different stages of this disease must be characterized. Results of this 
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future study can determine potential clinical usefulness for the early prediction of lung cancer 
and support breath as an assessable biological fluid for the discovery of biomarkers. 

The present study highlights two other major findings: characterization of benchmark 
proteome from breath-derived exosomes and the importance of breath proteome as a negative 
control when investigating the specificity of proposed biomarkers. Although, the focus of the 
study was on evaluation of breath exosomes defining their potential application in diagnosis, 
the possibility of using whole breath was also considered. Since we could not provide evidence 
of the presence of even standard tetraspanin EV markers in whole breath samples by LC- 
MS/MS (3.3.6) or western blotting (3.3.7) or, there was no point in attempting our exosome- 
led diagnostics on whole breath. 

Protein contamination of BEP described in 3.3.7 most likely occurred during EBC 
collection as contaminating proteins that originated from tissues and organs associated with 
respiratory tract and airways were noted. The most common contamination that has been 
previously described in several studies is associated with saliva proteins586587 Further, it has 
been reported that 10% of the “respiratory droplets” present in EBC are associated with oral 
portion of the airway, which includes tongue and salivary glands?**. Results of this study 
reported 12 proteins associated with salivary glands and 11 proteins associated with a group 
including salivary glands, lymphoid tissue58%5%) esophagus, and tongue. Lastly, the STRING 
analysis identified keratin protein complex amongst other largest protein complexes (Fig. 47). 
Fifteen proteins associated with keratin family were found in the core of BEP. Five out of 15 
proteins were associated with skin, while other 10 proteins had esophagus and tongue origin. 
This is not surprising as exogenous proteins or environmentally-derived proteins, such as 
keratins, that are present in condensate from the room and dust could also contribute 
contaminants of highly abundant ргоісіпв 591,592, 

The GO functional enrichment analysis and proteins’ network revealed insights of the 
breath proteome by highlighting several processes that are directly associated with exosomes 
biogenesis. In detail, the functional GO enrichment analysis of the core BEP revealed that 
breath-derived exosomes are indeed formed by the exosomes pathway which was confirmed 
by enriched cellular component “extracellular exosomes" (Pag 3.43 10-16%), As exosomes аге 
constantly released by all types of cell (healthy and diseased), it was important to determine 
whether a common subset of proteins can be determined across all types of exosomes 
mentioned in this study. Another question that this study tried to answer was whether there are 
cell type-specific proteins that can be found in the proteome of cellular exosomes that are 


similar or different to breath-derived exosomes°92. The presence of proteins associated with 
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cell adhesion molecule binding was shared between all types of exosomes. Interestingly, the 
cell-derived exosomes had integrin and RNA binding in their top 10, while breath-derived 
exosomes did not have either of them. Breath-derived exosomes were dominated by proteins 
associated with antigen binding. This is not surprising as exosome have been previously 
recognized as mediators of immune regulation??. Biological processes such as “vesicle- 
mediated transport", "transport", "regulated exocytosis" are common between the protein 
profiles of cell- and breath- derived exosomes and is consistent with previous studies??495, 
While “leukocyte mediated immunity" was predominant for breath exosomes, “establishment 
of localization” was leading in A549 exosomes, and “extracellular matrix organization” in WI- 
38 exosomes. Further analysis of biological processes revealed numerous similarities between 
WI-38 cell- and breath-derived exosomes, suggesting that selected model can provide a valid 
proxy. However, breath- and A549 cell-derived exosomes shared more common proteins than 
breath- with WI-38 cell-derived exosomes. This may arise from incomplete sampling of the 
proteome of these various types of exosomes, as well as from the contribution of exosomes 
from the airways other than the lung. 

In summary, the results of this study indicate that the protein cargo of cell- and breath- 
derived exosomes are somewhat similar. This level of similarity is not unusual. For example, 
two studies confirm that the protein profile of total blood exosomes resembles the protein 
profile of plasma еховотев 5%, A comparative analysis of the proteome of breath-derived 
exosomes from this study and the proteome of urine-derived exosomes from the literature was 
also undertaken and is presented in 3.3.7. This analysis showed that not only were over a third 
of the proteins found in exosomes obtained from breath shared in common with exosomes from 
urine, but also that most of the gene ontologies associated with the proteomes of exosomes 
from these two sources. That allows us to conclude that proteins identified in breath exosomes 
are somewhat similar to proteins identified in other types of exosomes isolated from different 


biological fluids as previously described in Chapter 2. 


3.5.1 Limitations 


The determination of the benchmark healthy lung exosome proteome was conducted 
using samples mainly from people in young adulthood (18 to 30 years). The median age for 


597 


lung cancer is 70 years", therefore this benchmark proteome is not appropriately age-matched 


for a future proteomic study on exosomes isolated from lung cancer patients. 
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However, the benchmark proteome is a valuable resource, being the first reported from 
exosomes isolated from breath, and has been obtained from an appropriate population for 
comparison with many other lung dysfunctions. In this study it was not possible to distinguish 
whether there are important differences between the proteins identified from male or female 
participants, as samples were pooled. Gender differences and individual variations in 
comprehensive analysis of proteome samples have been previously reported^?*599, This was 
done as the aim of the study was not to elucidate such differences, but rather to gain a broad 
overview of the proteins associated with exosomes isolated from breath, and any use of this 
data in developing diagnostic tests of lung function will ideally draw on proteins not associated 
with sex-associated differences. 

A large sample volume of exhaled breath was needed to recover enough exosomes to 
identify their protein cargo. Additional work is required to minimize the volume required to 
conduct the comprehensive analysis of exosomes, as it would be best to avoid prolonged breath 
collection from people suffering with lung dysfunction in any future matched study looking for 
biomarkers of disease. Ideally, the breath collection should be simple and fast with max 10-15 
minutes of breath donation. Any final diagnostic techniques will need to work well on much 
lower sample volumes, once biomarkers are established. In this study, only a single cell line 
was used to represent each of healthy lung and lung cancer. These cell models may not fully 
and accurately reflect the physiological condition they were chosen to represent. Validation of 
the proposed biomarkers will require different cell models. 

This study, however, did not set out to determine definitive biomarkers. Instead, the 
experiments aimed to prove that it would be possible to detect potential biomarkers in 
exosomes, and to thereafter develop an analytical tool to determine the presence or absence of 
such biomarkers, and this aim was most readily established with a simple set of cell models. 
Whether this limitation is anticipated, this was not a major problem in this study. The findings 
were relevant and represented in-depth understanding of the research goal in this chapter. 

Although all proteins found in exosomes in this study are well characterized, not all of 
them are recorded in every database used in the proteomic analysis. For example, whilst some 
proteins were excluded because they were annotated as commonly found in saliva in the 
Human Proteome Atlas (НРА), other proteins may also be contaminants in the sample 
relating to saliva but retained in the analysis as HPA had no annotation for them. Similarly, 
some protein accessions were not found in the gene ontology or KEGG pathway analysis. 
Omission or inclusion of such proteins from the analysis will have impacted in a small way on 


the proteomic analysis. However, a sufficient number of proteins were identified that the 
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impact of this should be minimal, especially ав our analysis focused оп the most strongly 
enriched terms or consistently identified protein complexes. Challenges highlighted in this 
chapter need to be taken for consideration for the future research to improve the interpretation 
and clinical usefulness of exosome-based protein biomarkers. Combining results presenting in 
this study with a future characterization of breath proteome from lung cancer diagnosed 
patients can provide comprehensive understanding whether the putative biomarkers can be 


used in diagnosis and clinical staging of lung cancer disease. 


3.6 CONCLUSION 


The first aim of this chapter was to determine the proteome of exosomes from cell models 
of normal lung tissue (WI-38 cells) and lung cancer (A549 cells) and to demonstrate that it is 
possible to identify potential biomarkers of lung cancer. After conducting proteomic analysis 
of exosomes from cell models, four putative biomarkers (GLUT, MYOF, CD151, and RhoC) 
were proposed. These proposed biomarkers were found only in exosomes isolated from the 
А549 cell model of lung cancer and were absent іп exosomes from WI-38 cells and from 
exosomes isolated from the breath of healthy human volunteers. 

The second aim of this chapter was to determine the benchmark proteome from breath- 
derived exosomes and the use of those exosomes as a negative control to test the proposed 
putative biomarkers of lung cancer. The proteome profile of breath-derived exosomes was 
identified for the first time from exosomes obtained from the breath of 55 healthy volunteers. 
Through GO functional enrichment analysis we showed that exosomes isolated from healthy 
lung cells and breath are similarly “programmed” reporting roughly similar biological 
processes, which we had expected. 

The study suggests that the proteome of breath-derived exosomes may be used in future 
studies identifying new biomarkers of lung dysfunction, with a final goal of proposing lung 
cancer biomarkers. Researchers have to be mindful that protein contamination of EBC 
specimens with salivary and other exogenous proteins could interfere with characterization of 
the exosomal proteome™!. At the end of this study, it has been determined that exosomes 
obtained from WI-38 cells could provide a valid proxy for breath-derived exosomes using 
bioinformatics techniques based on the identified proteomes. 

The next chapter will investigate whether exosome-based biomarkers can be detected 


using FRET methodology. 
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CHAPTER 4. DETECTION OF EXOSOMES - DIAGNOSTIC 
APPLICATION 


Aim: 
The aim of this chapter was to develop a sensitive test for the presence of the selected 


biomarkers of lung cancer found on exosomes from cell models. 


Objectives: 
1. Select antibodies corresponding to the proposed biomarkers, and label with fluorescent 


dyes to give suitable FRET pairs. 


2. Determine the FRET efficiency of each proposed fluorescent labeled antibody pair by 
conjugating them with all sources of exosomes with the goal of selecting 1-2 pairs using 


a FRET-based assay using time-correlated single photon counting. 


4.1 INTRODUCTION 


4.1.1 Overview of sensor detection methods 


The final component of this study was to look whether it is possible to develop a sensitive 
test for the presence of the selected biomarkers of lung cancer found on exosomes from cell 
models. This section will briefly give an overview of sensor devices and describe the 
requirements of sensors that are critical for diagnostic purposes. 

Sensors are widely used not only in medicine, but also in other areas such as security, 
food safety, and drug discovery as they allow detection of specific processes (biological, 


chemical, or physical) or molecules, for example protein and micro RNA biomarkers?? 


or gene 
mutation’. Novel sensors that can improve medical care, contribute insights into disease 
processes or assist with early detection are desirable. The pioneering work in developing 
sensors belongs to Clark who in 1956 developed an oxygen electrode for measuring oxygen 
levels in blood that was later named the “Clark electrode". This remarkable electrode was 
initially used in the heart-lung machine that Leland C. Clark Jr. also pioneered. A few years 
later, in 1962, the first prototype of a biosensor, a glucose biosensor, was introduced by the 


same inventor 604. What differentiates a biosensor from other sensors is that it uses a molecule 
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derived from а biological origin, often referred to as а bioreceptor, as ап integral part of the 
sensor function. 

Sensors measure biological or chemical reactions by generating signals proportional to 
the concentration of an analyte. Since Clark's glucose sensor, the field of biosensors started to 
rapidly develop, expanding both the number of devices and the scope of their application®. A 


urea-specific enzyme electrode was developed in 1969 by Guilbault and Montalvo Л, a 


fiber-optic biosensor for carbon dioxide and oxygen detection in 1975 by Lubbers and Optiz®’, 
and several commercial biosensors have been developed% 61! The potential applications of 
biosensors have been extensively researched both in vitro and in vivo. Implantable biosensors, 
for example, allow continuous measurements in living organisms. 

A medical sensing device must have components that allow it to achieve recognition, 
signalization, and quantification. Those components are a sensor, a transducer, and a system to 


display or record the outcome of the test‘!? 


. High performance biosensors must have specific 
characteristics. The most important characteristic of any type of biosensor is selectivity, which 
enables detection of a specific analyte in a sample. Once detection occurs, the response needs 
to be converted to a signal. The ability of a sensor to provide consistent responses when a 
sample is measured several times is known as reproducibility. This is the second most 
important characteristic. An ideal sensor provides a stable signal over a range of ambient 
conditions. Whilst every useful sensor has to be selective, consistent and stable, another 
important factor is sensitivity, as this defines the limit of detection. However, in order to 
contribute to disease diagnostics, new analytical methods that are high in efficiency and can 
use small sample volumes need to be developed. 

Depending on the method of signal transduction, biosensors can be grouped as optical, 
electrochemical, mechanical, acoustic, or magnetic (piezoelectric). Some biomolecules are 
difficult to detect so are studied via an attached label (antibody, fluorophore, or radioactive 
isotope) that does not affect the behavior of the labeled analyte. This was relevant to the study 
as two fluorophores (donor and acceptor) conjugated to antibodies were used in developing a 
FRET-based biosensor assay. While a variety of labels can be used, the labeling strategy must 
be carefully considered based on the intended application as some labels might block binding 
sites. Based on their method of detection, biosensors can be divided into two large 
groups: label-based and label-free??. Label-based biosensors have fluorescent, colorimetric, 
or luminescent signal labels that can provide specificity to the biosensors. This approach has 
been taken to detect interaction initially between a pair of general tetraspanin EV markers 


(CD63 and CD81) and then pairs of proposed biomarkers. In contrast, label-free biosensors 
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directly generate the signal by interaction with the analyte of interest*'*. Optical biosensors are 
the most common type of biosensor and have been developed in both label-based and label- 
free formats. 

There are several fluorescence-based techniques that include a fluorescent reporter that 
is necessary to detect the target analyte. Fluorescence-based biosensors can take many formats, 
and can use technologies that include sandwich immunoassays, immuno-magnetic beads, and 
FRET®°!5, Fluorescent labeling of proposed biomarkers with a pair of fluorophore dyes сап be 


employed to examine interactions through FRET, as has been shown in this study. 


4.1.2 Principles of FRET and its application in biosensors 


Identification of biomarkers can contribute to diagnosis and thus improve potential for 
therapeutic intervention and so improve outcomes. Ideally, biomarkers would enable early 
detection of disease. For decades, researchers have been working on the discovery of 
biomarkers that recognize a specific disease, and through application in biosensors may have 
sufficient sensitivity and selectivity to detect low signal abundance in small volumes of 
samples of biological origin. For a long time, researchers were curious as to whether it was 
possible to interpret and visualize events happening in living organisms on a molecular level. 
FRET-based biosensors have been perceived as an excellent and powerful tool, recognizing 
and defining the concentration of a target molecule. For sensing purposes, FRET-based 
biosensors have been extensively used in many biomedical applications including bioimaging 


and drug screening®!® 


. My rationale behind choosing FRET-based biosensing methods was that 
they are relatively simple and provide rapid and sensitive diagnostics, even down to single-cell 
level, and work in heterogeneous samples, such as ехоѕотеѕ®!”. 

As has been previously mentioned, FRET-based biosensors require two fluorescent 
probes: a donor and an acceptor. FRET operates through a distance-dependent transfer of 
energy. Briefly, the FRET process begins with excitation of the donor containing a fluorophore 
dye that initially absorbs the energy that then transfers it to an acceptor containing another 
fluorophore dye located in close proximity (typically less than 10 nm away)! arising from 
spectral overlap of the donor's emission and the acceptor's absorption spectra. Due to this 
process, the fluorescence intensity of the donor decreases, while the acceptor's emission 


intensity increases. Another requirement for FRET to occur is that there has to be a dipole- 


dipole interaction between the acceptor and donor®!, 
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Although ЕКЕТ is usually used to study intermolecular interactions ог conformational 
changes within a molecule, it can be also applied to biological assays. The key advantage of 
using FRET-based assays over biochemical assays is to achieve better subcellular resolution in 
a living сеП620, The efficiency of a FRET-based assay can be optimized by choosing ideal 
FRET pairs (donor and acceptor fluorophore dyes) that are essential for the high performance 
of the assay. 

This study introduces a FRET-based assay that was developed to detect the interaction 
between the proposed putative lung cancer biomarkers on an exosome surface with an ultimate 
goal to detect cancer using markers on exosomes. The assay was made by using a pair of 
antibodies corresponding to the proposed biomarkers, one labeled with a donor fluorophore 
and another with an acceptor fluorophore. These labeled fluorophores are incubated with the 
exosomes in solution. FRET efficiency is а distance-dependent matter for this method of 
detection. Binding of fluorophore-labeled antibodies in close proximity will occur only when 
the selected biomarker pair is co-located on an exosome surface and is in sufficient abundance 
to bring a high number of biomarker pairs within the distance limits of FRET. Based on the 
separation distance, the FRET efficiency can either decrease or increase. Two main methods 
of measuring the FRET efficiency are spectral methods™! and photoluminescence (PL) lifetime 


measurements412,622,623 


. In this study, PL lifetime measurement by time-correlated single 
photon counting (TCSPC) was chosen because neither the concentration of the fluorophore 
dyes nor light scattering in the sample affect measurements. The PL lifetime refers to the 
average time that a fluorophore spends in an excited state before returning to the ground state. 
By measuring the PL lifetime, changes in the fluorophore’s surroundings that impact lifetime 
can be observed and quantified. The presence of an appropriate acceptor fluorophore allowing 
for FRET provides a new pathway for the donor fluorophore to return to the ground state, 


resulting in a reduced PL lifetime. The process of energy transfer is shown on a simplified 


Jablonski diagram below (Fig. 52). 
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Figure 52. The Jablonski energy transfer diagram. The simplified diagram is showing energy transition including excitation 
of the ground state (Sy) and mechanisms of decay of excited states (81) to return to the ground state". 


The energy transfer process from a molecule in the excited state begins when the 
molecule from the ground state (So) gets excited after absorbing energy from a photon. From 
the excited state, a molecule returns to the ground state in one of three forms: emission of a 
photon, non-radiative decay, or ЕКЕТ. During the process when electrons go from excited 
states to the ground state, a loss of vibrational energy occurs. As a result of that, the emission 
spectrum becomes shifted to longer wavelengths than the excitation spectrum. This process of 
energy difference between absorption and emission spectra of a molecule was investigated by 
Sir George Gabriel Stokes who studied the properties of fluorescence and first observed this 
phenomenon in 1852, which was later named after him as the Stokes shift. During his 
observation, Stokes noticed that the wavelength of the exciting light was always lower than the 
wavelength of emitted fluorescent light. Later, the differentiated transition moments between 
absorption and emission were explained by Alexander Jablonski in an energy diagram that was 
then named after him as the “Jablonski energy diagram" (Fig. 52) and used to explain several 
processes including delayed fluorescence 627, 

As previously mentioned, the close proximity of the pairs is the most critical element for 
FRET. In this study, we tested whether pairs of proteins that are abundant in exosomes would 
be within the separation threshold required to give FRET using a simple fluorophore- 
conjugated antibody system. That was initially validated for general tetraspanin EV markers 
CD63 and CD81. If these two proteins are abundantly co-located оп ап exosome, we should 
expect the donor and acceptors fluorophores to be in close proximity and as a result observe 


reduction in PL lifetime, the FRET effect. However, a few more conditions have to be met. 
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The absorption spectrum of the acceptor must overlap the fluorescence emission of the donor 


(Fig. 53 A). This degree is known as the spectral overlap integral (J). 
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Figure 53. Schematic representation of ЕКЕТ. The figure is showing (A) the spectral overlap integral апа (B) fluorescence 
lifetime decay showing distance and energy transfer efficiency. Adapted from Olympus-lifescience.com 

Further, the dipole-dipole interaction between donor and acceptor resulting in energy 
transition expects dipoles to be aligned in parallel orientation. Finally, there are two key 
distances: the separation of the acceptor and donor fluorophores (ғ) and the Förster distance 
(Ro) of the donor fluorophore. The Forster distance is the distance at which energy transfer is 
50% efficient (Fig. 53 B). That means that 50% of excited donor energy is deactivated by 
FRET. The rate at which the non-radiative transfer of energy between a donor and an acceptor 


fluorophore occurs is known as kr and is defined in equation 2 (eq 2): 


kr= (1/хь) *[(Ro /r)°] (eq 2), 


where 7p is the fluorescence lifetime of the donor in the absence of the acceptor, Ro is the 
Forster radius, and r is the distance between the donor and acceptor fluorophores. Assuming 
that a pair is compatible with the previously stated requirements, the efficiency (Е) can be 


calculated using equation 3 (eq 3): 
E= В (5 R) (eq 3). 
The magnitude of the Förster radius (Ro) is highly dependent on the spectral properties of 
the donor and acceptor fluorophores and their mutual orientation. 


Fórster demonstrated that the most useful distance between donor and acceptor is 


between 2 and 9 nm and with the R,values around 5 nm for organic fluorophores. In practice, 
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the distance can vary somewhere between 4-7 nm due to the background поівеб28, Besides the 
already mentioned factors, there are a number of other important features, such as the 
fluorescence quantum yield of the donor in the absence of an acceptor (Фу), the refractive index 
of the solution (n), the orientation factor of each molecule (&?), and by the already mentioned 
spectral overlap integral (J). The relationship between these factors and the Förster distance 
(Ro) is shown below (eq 4): 


Ro=9.78 x 109 (n^* Фа *k?*7)" 5 А (eq 4). 


In practice, there are several limitations that can affect FRET measurements. The first 
limitation is associated with the different brightness level of fluorophores when imaged 
together, forcing one channel to become a dominant. This challenge can be avoided if the 
brightness level of selected dyes is relatively similar. The second limitation is a donor-acceptor 
stoichiometry that can interfere with detection if pairs lie outside of defined ratios 1:10 or 10:1. 
Excess concentration of one or the other will result in an absence of FRET even though the 
pair might meet prior requirements, such as the Fórster distance. The next challenge that can 
be easily reduced by choosing a well-separated donor and acceptor is crossover or bleed- 
through between spectral overlap of selected pairs. An advantage of analyzing FRET using PL 
lifetime measurement of the donor is that it overcomes this problem. Finally, the inappropriate 
alignment of selected pairs, when the Fórster radius between fluorophores is greater than 6 nm, 
can lead to a false negative or undetectable signal. That can often happen in protein-protein 


interactions if fluorophores are attached to the opposite sides of the protein-protein complex”. 


4.1.3. FRET, exosomes, and cancer diagnostics 


The range of biological applications of FRET has exploded over the past few years 
leading to an improved understanding of dynamic interactions of proteins that have significant 
impacts on signaling pathways. Current techniques, such as fluorescence imaging or electronic 
microscopy, were unable to provide reliable measurements due to limitations in spatial 
resolution or lack of labeling techniques®°. On the other hand, the application of intermolecular 
FRET had overcome those challenges. Generally, FRET application can be often seen in the 
characterization of membrane systems through protein-protein or lipid-protein interactions. It 
is a relatively simple and very powerful approach to study membrane biophysics®!. As 


described in section 4.1.2, FRET can be determined by differences in photoluminescence, 
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intensity (steady state), ог Бу detection of the fluorophore lifetime (time-resolved) that was 
used in this study. Time-resolved FRET measurements have one main advantage over steady 
state intensity measurements: the measurement is independent of the concentration of the 
fluorophore dyes. This is important in many biological applications, such as bioimaging. 
Although not used in this study, a brief description of microscopy imaging analysis is included 
for comparative purposes. 

Fluorescence microscopy is considered a fundamental unit of biomedical imaging due to 
its high sensitivity to the molecular environment and the ability to recognize changes in 
molecular confirmations. As a powerful technique, Fluorescent Lifetime Imaging Microscopy 
(FLIM) can provide a high-resolution analysis of changes occurring in cells on the molecular 
level by introducing fluorophores into molecular environments. FLIM can be used to monitor 
disease progression and drug efficacy reflecting insights into various processes in cells and 
tissues. Similar to FRET, FLIM exploits the lifetime property of fluorescence. However, FLIM 
offers several advantages over FRET. For example, the ability to visualize changes of 
fluorophore behavior in molecular environments within the proximity of FRET pairs that 
cannot be achieved only based on intensity теаѕшгетепіѕ6!8630, Another advantage is being 
independent of fluorophore concentration and detected intensity considered as “a self- 
referenced measurement”. That being said, there are several drawbacks that prevent FLIM 
taking a dominant approach over FRET. First, imaging is much more time consuming and 
requires more complicated high-cost equipment. Secondly, imaging in live cells can be 
challenging due to autofluorescence of cells and pH level630. 

Alternatively, in some cases the combination of FLIM-FRET could be a better option. A 
FLIM-FRET combination only uses the lifetime measurements of the donor. This use of only 
the lifetime donor mean that while some pairs are not effective in FRET experiments due to 
their spectral overlap issues, these can be used in FLIM-FRET™?. 

FRET-based applications gain their popularity due to the implementation of FRET 
techniques into molecular diagnostics. This interest was based on the ability to monitor 
molecular dynamics of cell membranes and other molecular interactions®**. Further, FRET- 
based biosensors and assays were utilized not only to study cellular dynamics but also applied 
on a single-cell level. Cancer biomarkers, such as proteins, DNA and micro RNA, can be used 
in ensemble with a FRET-based platform for monitor progression of diseases or their early 
diagnosis??. The advantage of FLIM was used by Wong's research group in identifying the 
status of dimerization of human epidermal growth factor receptors on circulating EVs in breast 


cancer?"^, Briefly, chosen receptors EFGR and HER3 were labeled with a fluorescently 
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conjugated primary antibodies pair of FRET fluorophores (Alexa 546 F4 (EGFR) and Cy5 
2F12 (HER3)). Then the lifetime of the donor was measured, and FRET was calculated based 
on the donor's lifetime with and without the acceptor (eq 3). Then lifetime images before and 
after treatment were compared tracking the changes related to the separation of chosen 
receptors in residual tumors. 

FRET was successfully applied to monitor protein synthesis in single live cancer cells as 
an aid to visualizing the production of proteins generated from fluorescently labeled tRNAs95. 
The selected cell line (multiple myeloma cells U266) was transfected with fluorescently labeled 
tRNAs that were used by ribosomes for protein translation. The study concluded that 
monitoring protein synthesis by FRET has a higher efficiency and sensitivity and can be 
mapped in real time compared to other previous successfully used methods, such as isotopic 
labeling of amino acids or non-radioactive surface sensing of translation assay based on flow 
cytometry principles. In another example a FRET-based immunoassay demonstrated the ability 
to distinguish between SCLC апа NSCLC lung carcinoma?*, Implementation of other types 
of cancer biomarkers, such us micro RNA, was proposed by using amplified time-gated FRET 
that can advance the possibilities of micro RNA-based cancer diagnostics and research??$?7, 

As mentioned in 4.1.1, FRET-based sensors can be used in living organisms. An example 
of that is shown by Nobis's group that engineered a RhoA FRET biosensor mouse to monitor 
RhoA activity in normal tissue homeostasis and disease progression in-vivo™®. In that study, 
the FRET biosensor was utilized to investigate the activity of the small GTPase molecule, 
RhoA, and access changes associated with different stages of mammary gestation. Further, 
monitoring changes in RhoA activity in the same mouse contributed to understanding the 
efficacy of drug-targeting in breast and pancreatic cancer. This study showed that FRET-based 
biosensors allow investigation of changes in the crosstalk between molecules of interest during 
disease progression. This offers potential to optimize drug targeting. Another example is 
developing a FRET-based molecular probe to monitor enzyme activity in breast cancer which 
allowed the researchers to detect tumors in vivo against a background of normal tissue”. 

Besides FRET/FLIM application in diagnostics, it has been also demonstrated as a 
preclinical tool to access drug delivery and efficacy in live tumors% and to experimentally 
validate a list of potential genes implicated in cancer, predicted by the gene prioritizing tool 
MaxLink™!. In addition, FRET/FLIM was used for phenotyping analysis of EVs that could 


improve the efficacy of the general principle EV-based diagnosis? 


. There is also a published 
use of FRET/FLIM to determine the efficiency of engineering the lipid composition of 


exosomes??, FRET pairs (donor-acceptor), i.e., ORP3-TRITC/VAP-A-GFP, Rab7- 
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RFP/ORP3-FITC, and Rab7-RFP/V AP-A-GFP, were used to determine potential interactions 
between ORP3, VAP proteins, and Rab7 in the biogenesis and dynamics of NEI-associated 
late endosomes. A recent study that reported using FRET with exosomes described the 
invention of a FRET magnetic aptamer-sensor that allows rapid and efficient detection of 
epithelial cancer achieved by magnetic enrichment of exosomes with aptamer functionalized 
fluorescent superparamagnetic microparticles assembled with gold nanoparticles conjugated with 
complementary DNA. The strategy of exosome detection was based on the FRET pair effect between 
quantum dot functionalized aptamer particles and gold nanoparticles with complementary DNA $. 

Quantitative FRET imaging was applied to visualize the pro-tumorigenic and pro- 
metastatic membrane-type 1 invasiveness of live cancer cells that can aid screening of effective 
anti-cancer inhibitors?^, Furthermore, FLIM has routinely been used in living cells for 
measuring of signaling events, such as protein-protein interactions5646, 

In summary, FRET-based sensing has been widely used in biomedical applications 
including diagnostics. The FRET-based assay operates through a distance-dependent transfer 
of energy between a donor and acceptor. Implementation of FRET techniques into molecular 
diagnostics based on antibody-antigen interaction can be utilized for early detection of various 
disease including lung cancer. In addition, the efficiency of a FRET-based assay can be 


optimized by choosing ideal FRET pairs (donor and acceptor fluorophore dyes). 


4.2 METHODS 
4.2.1 Preparation of AlexaFluor 488 (AF488)/AlexaFluor 546 (AF546)-proposed 


biomarkers conjugates 


The labeling of proposed biomarkers with fluorophore dye, either AF488 (44,7488 nm; 
Àem-520 nm) or AF546 (44-556 nm; Aem=573 nm) N-hydroxysuccinimide ester 
(ThermoFisher), followed a previously published protocol?" Briefly, 1 mg of AF488 was 
diluted іп 100 uL dimethyl sulfoxide (DMSO) and used immediately. Antibodies (CD151, 
GLUT, МҮОЕ, RhoC) used in the conjugation (identical to those described in 3.2.10) were 
diluted in freshly made 0.1 M sodium bicarbonate buffer, pH 8.3 in ratio 1:10 at the final 
protein concentration of 5 mg/mL and of 20 uL volume. To this solution, 1 uL of the reactive 
dye solution (AF488-NHS ester) was added and incubated overnight in dark place at 
4 °C under gentle shaking. After the incubation, unbound dye was removed by buffer exchange 


using protein desalting spin columns (ThermoFisher; 700 uL of resin іп 10 mM Tris, pH 7.5 
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with 0.0296 sodium azide). First, the packing buffer was removed by centrifuging the column 
for 1 min in a swinging bucket centrifuge at 1,000xg. Then, the column was equilibrated with 
400 uL exchange buffer (1хРВЅ (phosphate buffered saline with a final concentration of 0.01 
M (composition NaCl -137 mM, KCl- 2.7 mM, Na2HPOs- 10 mM)) pH7.4 with 0.9% sodium 
azide). After loading the sample, the column was centrifuged for 5 min at 1,000xg and the 


purified sample was collected and stored at 4 ?C. 


4.2.2 Determination of Degree of Labeling (DOL) 


After the process of labeling antibodies was complete, the unbound dye was removed 
through gel filtration using buffer exchange columns. Once all excess of unbound dye was 
removed, the degree of labeling was determined following several steps. First, the absorbance 
of the purified antibody-dye conjugate from 4.2.1 or 4.2.2 was measured at 280 nm to estimate 
the protein concentration. The molar concentration of the antibody (Сув, м) in the sample was 


calculated using equation 5 (eq 5): 


Ciuc. m= ((А-- Aaye (Adye)CF)/€iecL)DF (eq 5), 


where A»so is the measured absorbance of the sample, Aaye (Aaye) is the measured absorbance of 
the sample at the identified wavelength of maximum absorbance of the labeling fluorophore, 
CF is the ratio for a solution of the fluorophore alone of Aay«(280)/A4,. (Aaye), Еке is the molar 
absorption coefficient of the antibody at 280 nm (equal to 203,000 M'cm"), L is the optical 
path length, and DF is the dilution factor applied to the labeled antibody in determining the 
absorbance values. Similarly, the molar concentration for the conjugated dye in the labeled 


sample (См) was calculated by using equation 6 (eq 6): 


Caye, м- (Aaye(Adye)/EdyeL)DF (eq 6). 


The degree of labeling (DOL) was determined from equation 7 (eq 7): 


DOL = Сақ м/Съсм (eq 7). 


In this study, extinction coefficients (g) of 71000 M^cm" with CF 0.11 for AF488 
and 104000 Mcm with СЕ 0.12 for AF546 were used 6%. 
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4.2.3 Exosome conjugation with labeled antibodies 


The antibodies against CD81, С 151, GLUT, and RhoC were conjugated with the 
AF488 donor fluorophore and antibodies against CD63, CD151, GLUT, and MYOF were 
conjugated with the AF546 acceptor fluorophore. Four controls chosen: 1) both fluorophore- 
labeled antibodies (AF488/546) in the absence of exosomes; 2) exosomes conjugated with 
labeled a donor-labeled antibody with free AF546 dye; 3) exosomes in the presence of both 
free dyes AF488/546; and 4) exosomes conjugated with only the donor-labeled antibody. 
Exosomes conjugated with both labeled antibodies were considered as the ‘FRET sample’. 
Cell- and breath-derived exosomes were isolated by ultracentrifugation as described above in 
2.4.3 and re-suspended in 20 uL of fresh 1xPBS. Then 1 uL of each of AF488- and AF546- 
labeled antibodies were added to all required FRET controls and the sample established above 
and incubated for 2 h at RT in the dark under gentle shaking. Samples volumes were adjusted 
to 0.5 mL with 1XPBS and passed through a size exclusion column (qEVoriginal column, Izon 
Science) to remove unbound fluorophore conjugates as previously described in 2.4.4. 
Samples were stored in the dark at 4 °C until required. Free dye required for controls 2 and 3 
was used at the same concentration as measured in the labeled antibody 
samples. Determination of dye concentration in the FRET sample was done through 
absorbance measurements for both dyes (AF488 and AF546) as described in 4.2.4 using a 


Cary 100 ultraviolet-visible spectrophotometer. 


4.2.4 Determination of fluorescent dye concentration in the FRET sample 


Measurements were taken at absorbance wavelength (Acx) of 494 nm and emission 
measured at a wavelength (Aem) of 517 nm for AF488 апа with absorbance wavelength (Acx) 
of 556 nm and emission measured at a wavelength (Aem) of 573 nm for AF546at 488 nm using 
a Cary 100 ultraviolet-visible spectrophotometer. Then, the concentration was calculated 


using the Beer-Lambert Law equation. 


4.2.5 Time-resolved studies and FRET measurements 


A labeled exosome sample of total volume 22 uL was transferred into a UV-transparent 


cuvette (3.5 mL, light path 10 mmz 0.08 mm, wavelength range 190 nm-2.5 um, ThorLabs) 


for PL measurements and diluted up to 2 mL with 1xPBS. The PL lifetimes of the sample and 
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controls were determined with a F980 Fluorescence Spectrometer (Edinburgh Photonics). PL 
decay scans were collected at a time scale of 500 ns over 1024 channels (0.4883 ns time/ch), 
10,000 peak counts with the 1.5 ms (microseconds) pulse period, with excitation wavelength 
(Хх) of 380 nm and emission measured at a wavelength (Aem) of 520 nm. The Forster distance 
value of 6.4 A was taken from the literature. The instrument response function (IRF) was 
measured using fresh milk to scatter excitation light. The raw data was collected and averaged 
from three biological samples and repeated twice for each condition. The ONE all-inclusive 
F980 software packaging was used to analyze the fluorescent lifetime data and to undertake 
decay fitting to approximate the IRF. Tail fitting was based on the Marquardt-Levenberg 
algorithm using a two exponential decay fit, which in almost all cases provided a higher quality 
fit to experimental data that a single exponential fit in these complex samples. The quality of 
fit was determined through chi-squared (у?) and residual plots. The average lifetime (Tave) of 
the donor in the FRET test sample and controls was obtained by TCSPC using equation 8 
(eq 8): 


Таш = (OUT? oot? )/ (out от.) (eq 8), 


where tis the lifetime and о is the pre-exponential factor for each of the two exponential 
functions. The efficiency of energy transfer for different acceptors and donors was calculated 


using the donor in the presence and absence of the acceptor using equation 9 (eq 9): 
EF 1-Tavgpa/Tavgo (eq 9), 


where Е is the FRET efficiency, Tavg pa is the PL lifetime of a donor-labeled antibody that is 
co-conjugated onto an exosome sample with an acceptor-labeled antibody (i.e. from the “FRET 
sample"), and tavgp is the PL lifetime of ап donor-labeled antibody conjugated onto an 
exosomes sample in the presence of acceptor free in solution (i.e., from control 2). In addition, 
the distance of separation between donor and acceptor (r) was quantified using Förster distance 


(Ro, 6.4 Å) in the equation 10 (eq 10): 


r= SQUE) - 1] * Ro (eq 10). 
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4.2.6 Statistical analysis 


Analyses were performed using Prism GraphPad scientific 2D graphing and statistical 
software, version 9.1.1 (223) for macOS, April 16, 2021, San Diego, California. The six 
replicates’ data from FRET samples and controls were tested for normality of distribution by 
constructing a Quantile-Quantile (Q-Q) graphical scatterplot using the Normality Tests 
diagnostic function. To confirm the statistical difference between FRET samples and controls, 
a two-tailed t-test was applied with a confidence level set to 95% and based on statistical 


significance (p-value <0.05). 


4.3 RESULTS 


This section describes the results from several steps. In order to collect accurate and 
sufficient data from the experiment, several steps were followed. First, the antibodies were 
labeled with fluorophore dyes to later take part in the FRET-based assay. Second, the degree 
of labeling was measured to ensure the quality of the assay. For that, the concentration of the 
dyes іп the FRET samples was defined. Third, selected FRET pairs were tested first on 4549 
cell-derived exosomes to select the most efficient pair/pairs. Finally, the pair/pairs showing 
greatest percentage of efficiency were further tested on negative controls: WI-38 cell-derived 


exosomes and breath-derived exosomes. The results of each step are reported below. 


4.3.1 Labeling antibodies and determination of degree of labeling 


Antibodies were labeled as described in 4.2.1, and the degree of labeling was determined 
as described in 4.2.2. Because the presence of unlabeled or multiple-labeled antibodies can 
alter the FRET signal, determination of this parameter was critical for FRET experiments to 
ensure reproducibility and quality control of fluorescence intensity of fluorophore-labeled 


bioconjugates. The results of degree of labeling are shown in Table 11. 


Table 13. Degree of labeling after labeling antibodies with selected fluorophore pairs AF488/AF 546. 


Donor (AF488) + proposed biomarker Donor Acceptor (AF546) + proposed biomarker Acceptor 


CD81 8.642 CD63 5.894 
RHOC 8.21 CD151 5.43 
GLUT 8.34 MYOF 5.75. 
С0151 7.98 МҮОЕ 4.54 
С0151 8.28 GLUT 5.97 
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In general, the ideal degree of labeling for both donor and acceptor labeled antibodies is 
considered within the acceptable range of 2 to 10 (dye molecules per protein)’. That suggests 
that results reported in Table 13 are within the acceptable levels of labeling. 

For reproducibility of the experiment, it was necessary to measure the apparent dye 
concentration in the dye-labeled antibody solutions to get the dye concentrations in the controls 
and the sample to match. That was done to keep data consistency. The absorbance spectra for 
the dye component in cell-derived exosomes that was conjugated to antibodies is shown on 


Figure 54. 
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Figure 54.The absorbance spectra for the dye component in cell-derived exosomes conjugated to antibodies. On the figure, 
A549 exosomes are represented by orange line and WI-38 exosomes are shown in green. The baseline is represented by the 
blue line, while blank is by the red line. The Y-axis represents absorbance values, and X-axis represents wavelength (nm). The 
figure was generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


The concentration of the dye components of the fluorescently labeled antibodies are 
shown in Table 14. 
Table 14. Absorbance values of dyes and their concentration in the sample. The concentration of dve component in exosomes 


conjugated with fluorescent labeled antibodies was calculated based on dyes’ absorbance values and the extinction 
coefficients were taken from the literature’. 


Sample Dye Absorbance values Extinction coefficients (liter/mol-cm) Concentration (uM) 
A549 AF488 0.097137 71000 1.368 
А549 АЕ546 0.022299 104000 0.214 
УУІ-38 AF488 0.102822 71000 1.448 
WI-38 AF546 0.032806 104000 0.31544 


After antibodies were labeled and the degree of labeling was determined, the next step 


was testing FRET pairs on A549 cell-derived exosomes. 
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4.3.2 Determination of the ЕКЕТ efficiency оп general tetraspanin EV markers 
(preliminary study) 


Although widely used in molecular diagnostics, FRET-based assays have not been 
previously applied to confirm the presence of exosomes by using protein biomarkers. In this 
study, two fluorophore dyes AF488 (donor) and AF546 (acceptor) were chosen as the donor 
and acceptor respectively, as these had been successfully used in FRET experiments. First, 
the FRET assay was tested with labeled (the chosen dyes were attached to an antibody) 
antibodies corresponding general tetraspanin EV markers (CD63 and CD81) which are 
embedded in the surface of exosomes. The key, sample, and controls selected to assess the 


potential for non-specific interactions chosen for the experiment are presented on Figure 55. 


LEGEND KEY: SAMPLE AND CONTROLS: 
EXOSOME FRET- 
SAMPLE 


CONTROL 1 
DONOR Ж + A 


Ab conjugated to AF488 

* 
ACCEPTOR D + ak CONTROL 2 
Ab conjugated to AF546 


* 
+ ES CONTROL 3 
FREE DYE AF488 
жу ж 
FREE DYE AF546 + Ж CONTROL 4 


жж * > ©) 


Figure 55. Legend key for the initial FRET experiment. Controls 2-4 marked by an asterisk were later 
modified. 


The results of the FRET experiment based on the data of six replicates for each sample 
are reported in Figure 56, where panel А represents all controls previously defined in Figure 
55, and panels B-D show the FRET outcome for cell- and breath-derived exosomes compared 


to the controls. 
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Figure 56. PL lifetime analysis of cell- and breath-derived exosomes labeled with general tetraspanin antibodies. PL 
lifetimes obtained by TCSPC with excitation wavelength (dex) of 380 пт апа emission measured at a wavelength (dem) of 
520 nm of (A) controls and (B,C) cell (A549, WI-38 respectively)- and (D) breath-derived exosomes conjugated with 
fluorescently labeled antibodies corresponding to the general tetraspanin markers (CD63 and CD81) showing four selected 
controls and FRET (donor-acceptor) samples. Antibodies against the selected biomarkers for these measurements were 
labeled as follows: anti-CD63 AF 488 - anti-CD81 AF 546. Controls on panels B-D are identical to those shown on panel А. 
The Y-axis represents normalized photon counts, and X-axis represents decay time (ns). The instrument response (IRF) is 
shown on each panel in black. The experiment was performed on three biological samples and run in duplicate. The error 
bars represent standard deviation. Some error bars are not visible as the data in the duplicates were close to identical. The 
figure was generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


Based on experimental data, a significant decrease was observed in the PL lifetime of the 
donor when the dye-conjugates were attached to antibodies against the general tetraspanin EV 
markers CD63 and CD81 and incubated with any of A549 cell - (43.8% efficiency), WI-38 cell 
- (44.3% efficiency), or breath-derived exosomes (55.9% efficiency) (Table 15). 


Table 15. PL lifetime analysis of cell- and breath-derived exosomes labeled with fluorophore-conjugated general 
tetraspanin antibodies. Emission decay analysis of a pair of antibodies corresponding to general tetraspanin markers (CD63 
and CD81) on cell- and breath-derived exosomes. The data was collected from three biological samples run in duplicate. 


a 


Source of exosome Tavg (ns) Tavg €ntl (ns) E (%) r (nm) p 
А549 2.02+0.010 3.59+0.005 43.8 7.34 7.7х10722 
УУІ-38 2.02+0.001 3.63+0.003 44.3 7.35 6.0x107?^ 
Breath 1.590.003 3.600.004 55.9 7.13 14x10? 


Tayg— PL lifetime of FRET pairs, shown + standard deviation; тау cntl— PL lifetime of control, PL lifetime of the exosomes 
conjugated with a AF488 labeled antibody in the presence of free AF546 dye control, shown + standard deviation; E (%) 
FRET efficiency; r — distance between the donor and acceptor fluorophores 


а p-values from unpaired, two-tailed t-test comparing PL lifetime of sample to control 
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As expected, given this experiment was using general tetraspanin EV markers, the 
lifetime decay for both cell-derived exosomes is almost identical, which resulted in a very 
similar FRET efficiency. Lifetime decay for breath exosomes was slightly faster than in cell- 
derived exosomes reporting higher efficiency. At that point a positive outcome observed in 
reduction in PL lifetimes was expected as these markers (CD63 and CD81) are both known to 
be abundant on exosomes. The ability of a FRET-based assay to distinguish between cancerous 
and non-tumorigenic exosomes was tested via the use of the proposed biomarkers (CD151, 


GLUT, MYOF and RhoC) and described in the next subchapter 4.3.3. 


4.3.3 Determination of the FRET efficiency of proposed biomarkers 


For this experiment, antibodies corresponding to proposed biomarkers were labeled with 
the donors (CD151, GLUT, and RHOC) and the acceptors (CD151, GLUT, and MYOF) 
creating four pairs: RHOC-CD151, GLUT-MYOF, CD151-MYOF, and CD151-GLUT (Fig. 
57). 


LEGEND KEY: SAMPLE AND CONTROLS: 
EXOSOME FRET- 
SAMPLE 


DONOR А + A CONTROL 4 


Ab conjugated to AF488 


ACCEPTOR + CONTROL 2 
Ab conjugated to AF546 


FREE DYE AF488 © + š CONTROL з 


FREE DYE AF546 
CONTROL 4 


Figure 57. Various controls and FRET samples for the FRET-based detection assay. 


еж EO 


Among these four pairs, separate donor-labeled and acceptor-labeled samples of CD151 
and GLUT were prepared. This decision was determined by the choice to test two of the 
markers as both donor and acceptor and opted to use the strongest (GLUT) and weakest 
(CD151) bands on the blot in Figure 35 for this purpose. Controls were selected to assess the 
potential for non-specific interactions, as shown in Appendix, Figs. 5 1-10. 

First, PL lifetime measurements of dyes and conjugated dyes were collected. There were 
no change between the isolated donor dyes’ photoluminescence (PL) and that of the conjugated 
complexes (Fig. S 11). Then, PL lifetime measurements were carried out with A549 cell- 


derived exosomes with pairs of donor- and acceptor-labeled antibodies (Fig. 58). 
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Figure 58. PL lifetime analysis of A549 cell-derived exosomes labeled with proposed biomarkers. PL lifetimes obtained by 
time-correlated single photon counting (TCSPC) (¿,,=380 пт; hem=520 nm) of A549 cell-derived exosomes conjugated with 
fluorescently labeled antibodies corresponding to the proposed biomarkers in combination of (A) CD151-MYOF, (B) RhoC- 
Ср151, (С) GLUT-MYOF, апа (D) CD151-GLUT respectively showing four selected controls and FRET (donor-acceptor) 
samples. Antibodies against the selected biomarkers for these measurements were labeled as follows: CD151 AF488-MYOF 
AF546, RhoC AF488-CD151 AF 546, GLUT AF488-MYOF AF 546, and CD151 AF488-GLUT AF 546. The Y-axis represents 
normalized photon counts, and X-axis represents decay time (ns). The instrument response (IRF) is shown on each panel in 
black (dashed). The experiment was performed on three biological samples and run in duplicate. The error bars represent 
standard deviation. Some error bars are not visible as the data in the duplicates were close to identical. The figure was 
generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


The FRET efficiency was calculated for all proposed pairs. The derived data is in Table 16. 


Table 16. PL lifetime analysis of A549 cell-derived exosomes labeled with fluorophore-conjugated antibodies 
corresponding proposed biomarkers. Emission decay analysis of pairs of antibodies corresponding proposed biomarkers. 
The data was collected from three biological samples run in duplicate. 


FRET pairs Tavg (ns) Tayg €ntl (ns) E (99) r (nm) р 
CD151-MYOF 3.94+0.010 4.25+0.075 7.30 9.77 1.2х10* 
RhoC-CD151 3.40+0.027 4.29+0.027 20.8 8.00 2.2х10:15 
GLUT-MYOF 4.22+0.032 4.33+0.008 2.53 11.8 9.5х10% 
CDI5I-GLUT^ 4.29+0.043 4.02+0.039 МА МА 4.5x107 


Tave— PL lifetime of FRET pairs, shown + standard deviation; Tayg спі PL lifetime of control, PL lifetime of the exosomes 
conjugated with a AF488 labeled antibody in the presence of free AF546 dye control, shown + standard deviation; E (%) 
FRET efficiency; r — distance between the donor and acceptor fluorophores 


4 p-values from unpaired, two-tailed t-test comparing PL lifetime of sample to control 


^ The derived data for pair CD151-GLUT is not shown as the lifetime of the sample was longer than the controls. 
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The data in Table 16 confirms that the highest FRET efficiency occurred іп the CD151- 
MYOF and RhoC-CD151. Therefore, we decided to proceed only with these two pairs, CD151- 
MYOF and RhoC-CD151, that are significant and have the largest difference between the 
control and FRET-samples. No efficiency or spacing is calculated for the CD151-GLUT pair, 
given the increase lifetime in the FRET sample noted above. 

Then, the six replicates’ data were tested for normality of distribution by constructing a 
Quantile-Quantile (Q-Q plot) (Fig. 59). The data in the scatterplot lies along the diagonal, 


confirming that the data is normally distributed. 
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Figure 59. Normal О-О (Quantile-Quantile) scatterplots. The О-О scatterplots are showing Control and FRET sets of 
quantiles to confirm the normal distribution. The Y-axis represents theoretical quantiles, and X-axis represents sample 
quantiles (п-б). The figure was generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, 
California. 


After verifying that data from these experiments was normally distributed, controls and 
FRET-sample pairs were compared using the t-test. T-test was performed to identify whether 


the fluorescence lifetimes in the samples were significantly different from the controls (Fig. 
60). 
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Figure 60. T-test statistical analysis of FRET pairs on A549 cell-derived exosomes. Statistical representation of the two- 
tailed t-test for all proposed FRET pairs: (A) CD151-MYOF, (B) RHOC-CD151, (C) GLUT-MYOF, and (D) CD151-GLUT 
(n=6) when tested on exosome isolated from A549 cells. The p-value for all pairs was equal (A) or below 0.0001 showing 
very significant results. The Y-axis represents the decay time (ns). The figure was generated by using Prism GraphPad 
Sofiware (version 9.1.1) for macOS, San Diego, California. 
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From the results of the t-test, two pairs were excluded. Although pairs GLUT-MYOF (С) 
and CD151-GLUT (D) did show a statistically significant difference (р <0.0001), the absolute 
difference between samples and controls for the GLUT-MYOF pair was very small, which 
could hamper future application to a diagnostic assay. Unexpectedly, the data for the FRET 
sample with the CD151-GLUT pair showed a longer lifetime than the controls. 

The confirmation of hypothesis by evidence allowed moving to the final step of the 
experimental study: to confirm that the proposed FRET-based assay was able to distinguish 
between exosomes derived from a model of lung cancer (A549 cells) and exosomes derived 
from other sources (WI-38 cells and EBC of healthy volunteers) (Fig. 61). The FRET-samples 


and controls were chosen following the same combinations annotated in Figure 57. 
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Figure 61. PL lifetime analysis of WI-38 cell- and breath-derived exosomes labeled with proposed biomarkers. PL lifetimes 
obtained by time-correlated single photon counting (TCSPC) (dex =380 nm; dem=520 nm) of (A,B) WI-38 cell- and (С.Р) 
breath-derived exosomes conjugated with fluorescently labeled antibodies corresponding to the most efficient pairs of 
proposed biomarkers in combination of CD151-MYOF and RhoC-CD151 respectively showing four selected controls and 
FRET (donor-acceptor) samples. Antibodies against the selected biomarkers for these measurements were labeled as follows: 
(A.C) CD151 AF488-MYOF AF 546 апа (B, D) RhoC AF488-CD151 AF 546. The Y-axis represents normalized photon counts, 
and X-axis represents decay time (ns). The instrument response (IRF) is shown on each panel in black (dashed). The 
experiment was performed on three biological samples and run in duplicate. The error bars represent standard deviation. 
Some error bars are not visible as the data in the duplicates were close to identical. The figure was generated by using Prism 
GraphPad Software (version 9.1.1) for macOS, San Diego, California. 


The decay lifetimes of the FRET samples and controls for WI-38 cell- and breath-derived 


exosomes were very similar. The derived lifetime data of these pairs is shown in Table 17. 
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Table 17. PL lifetime analysis of successful FRET pairs on negative control samples. Emission decay analysis of the most 
efficient pairs of fluorophore-conjugated antibodies (RHOC-CD151 and CD151-MYOF) tested on negative controls (WI-38 
cell (annotated as W)- and breath (annotated as B)-derived exosomes). The data was collected from three biological samples 
run in duplicate. 


FRET pairs Tavg (ns) Тау cntl (ns) E (%)? r (nm)? p 
CD151-MYOF (W) 6.59--0.010 6.35+0.017 МА МА 41х10:4 
RhoC-CD151 (W) 6.92+0.025 6.81--0.008 NA NA 1.1х10% 
CD151-MYOF (В) 4.09+0.011 4.09+0.009 МА МА 0.3 
RhoC-CD151 (В) 3.86+0.024 3.63+0.007 МА МА 82х1019 


Тад- PL lifetime of FRET pairs, shown + standard deviation; Tavg cntl— PL lifetime of control, PL lifetime of the exosomes 
conjugated with a AF488 labeled antibody in the presence of free AF 546 dye control, shown + standard deviation; E (%) 
FRET efficiency; r — distance between the donor and acceptor fluorophores 


4 p-values from unpaired, two-tailed Т tests comparing PL lifetime of sample to control 


b The derived data for all pairs is not shown as the lifetime of the sample was longer/or equal than/to the controls. 


After verifying that data from these experiments was normally distributed, controls and 


FRET-sample pairs were compared using the t-test (Fig. 62). 
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Figure 62. T-test statistical analysis of FRET pairs on WI-38 cell- and breath-derived exosomes. Statistical representation 
of the two-tailed t-test (n=6) for two chosen FRET pairs, RHOC-CD151 and CD151-MYOF, tested on negative control 
samples (А,В) WI-38 cell- апа (С.Р) breath-derived exosomes. The p-value for all pairs, except CD151-MYOF for (D) breath- 
derived exosomes, was below 0.0001 showing extremely significant results. Pair CD151-MYOF has a non-significant p-value 
(p = 0.2674). The Y-axis represents the decay time (ns). The figure was generated by using Prism GraphPad Software (version 
9.1.1) for macOS, San Diego, California. 


Whilst three of the four samples showed statistically significant differences between the FRET 
sample and the control (р <0.0001), the magnitude of the differences for CD151-MYOF pair 
tested on breath-derived exosomes was relatively small (o = 0.2674), and with the slightly 
shorter lifetimes found in the controls — the opposite of what would be expected for FRET. 
Put together, these results showed that two of our proposed pairs of biomarkers can 
distinguish exosomes derived from a cell model of lung cancer and from exosomes derived 


from a model of normal lung tissue or from the breath of healthy volunteers. 
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4.4 DISCUSSION 


The research challenge in this chapter was associated with choosing suitable methods and 
techniques to test several pairs of biomarkers that were combinations of the putative 
biomarkers identified as a result of the proteomic analysis identified in Chapter 3. 

The clinical relevance of FRET-based assays has been shown in oncology, protein- 
protein interaction, micro RNAs, diagnosis of infectious disease (bacterial infection and 
sepsis), prenatal screening, in point-of-care devices, and including their application to cancer 
diagnostic assays5?. 

One of the challenges in this chapter was to develop a more straightforward FRET-based 
assay system to determine the co-location of a pair of biomarkers, which we reasoned would 
lead to higher diagnostic specificity than a single marker. As has been previously mentioned 
in 4.1.2., FRET depends on distance-dependent energy transfer between a donor and acceptor 
pair of fluorophores. Therefore, chosen marker pairs would need to both be present as abundant 
constituents of the same exosome, a characteristic we hoped would readily distinguish between 
exosomes from a model of lung cancer and from models of normal lung. The energy transfer 
associated with FRET can be determined by differences in PL, intensity, or by detection of the 
fluorescence lifetime. Time-resolved fluorescence lifetime measurements have the advantage 
that they are independent of the concentration of the fluorophores, and so were chosen as the 
basis for the assay system in this study. Quantitative FRET measurements were employed. 
Achieving this milestone required making of custom fluorescent-labeled antibodies 
corresponding to appropriate donor and acceptor pairs followed by conjugation of these custom 
labels with different sources of exosomes. The fluorophore pair AF488/AF546 was chosen as 
the ideal combination to attain this goal. 

In order to obtain high-quality FRET data, several appropriate controls were selected. 
Selected antibodies corresponding to the proposed biomarkers identified in Chapter 3 were 
directly labeled with fluorophore dyes and exploited in the experimental study. 

During the selection process six pairs of fluorophore candidates (AF488/546, AF 
488/514, AF488/532, AF647/680, AF647/700, AND AF647/750) were taken for 
consideration, given they had appropriate levels of spectral overlap. The final choice was made 
after confirming the performance of the dyes based on the distance separating the donor and 
acceptor and the brightness and stability of the fluorophores. Based on a review of literature 
that reports data analysis results of FRET pairs tested on biological systems, one out of the six 


previously mentioned FRET pairs, AF488/546, was considered optimal. It appears, that the 
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same ЕКЕТ pair had been used іп several studies where AF488 played а role аз а donor and 
AF546 was an ассеріогб25:60:65!, In the first study, the pair was successfully used to support 
nano communication by playing roles as nano transmitters and receivers creating multi-input 
multi-output FRET communication channels. The results of this study showed that the 
AF488/546 pair was the most successful pair compared to AF405/488 and АҒ405/546625, А 
second paper utilized the same fluorophore dyes to study protein-protein interactions 
suggesting that their application to in solution FRET would have value in drug discovery and 
could potentially be developed into a drug screening platform®°. A further study investigated 
bleed-through from excitation leading to cross talk in emission detection. The study pointed 
out that the concentration of the donor and acceptor should be in the same order of magnitude 
to result in accurate calculation of FRET efficiency®!. After demonstrating meeting the needs 
of FRET requirements, the AF488/546 pair was selected for conjugation with proposed 
biomarkers. 

The FRET experiment began with labeling antibodies and determination of the degree of 
labeling. First, the selection of donors and acceptors for proposed antibodies took place. 
Previously, in 4.1.2 it was said that the efficiency of FRET-based assay depends on the choice 
of FRET pairs (donor and acceptor fluorophore dyes) that are essential for the high 
performance of the assay. The choice of labeling antibodies with the donors or the acceptors 
and the pair placement was made based on the result from western blotting reported in Chapter 
3 (Fig. 35, Appendix Figs. 5 1-10), specifically, the width of the band. Because the method of 
FRET efficiency was chosen to be based on the detection of the donor fluorophore’s lifetime, 
antibodies with strong distinct bands were labeled with the donors. The biomarkers that formed 
the most distinct bands in the western blot experiment were conjugated with the acceptor 
fluorophore. That resulted in forming four pairs (donor-acceptor): RHOC-CD151, GLUT- 
MYOF, CD151-MYOF, and CD151-GLUT. 

Labeling antibodies was not the end point of this step. It was necessary to know the 
concentration of dye conjugated to antibodies as it was an essential point for selected controls 
ensuring a good quality of the assay. For that, the degree of protein labeling based on the 
antibodies and dye absorbances had to be determined. Although choosing the ideal FRET pair 
was essential, the optimization of labeled antibodies was equally important. To ensure that the 
FRET signal would be maximized, it was necessary to minimize any interference during the 
labeling process. Following that, the unbound dye was removed through the gel filtration 


process using buffer exchange columns. That was critical for the labeled antibodies to 
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recognize protein on the surface of exosomes. Use of appropriate controls and replicate 
experiments ensured that high quality data was obtained. 

FRET assays have not been previously applied to confirm exosomes by using protein 
biomarkers. Given this lack of prior art, before proceeding to test its application with the 
proposed protein biomarkers, it was first attempted using general tetraspanin EV markers 
CD63 and CD81. This was undertaken to test the specificity and sensitivity of this FRET-based 
assay. The assay was developed following several steps. First, EBC was collected and followed 
by exosome isolation (ultracentrifugation). Exosomes were then stored at 4 “С until the next 
step. Second, antibodies corresponding to the proposed biomarkers were labeled with a selected 
donor and acceptor fluorophore pair. After removal of any excess dye, exosomes were 
incubated with the labeled antibodies. Exosomes were re-isolated by SEC to pellet 
unconjugated dyes and antibodies. Third, samples containing several controls and exosome 
samples conjugated with labeled donor and acceptor were prepared and followed by lifetime 
measurements. The expectations of the detection methods were as followed. If FRET occurs, 
the lifetime of the donor will be much shorter in the FRET system than in a control where the 
dyes are not brought into close proximity. 

As described above, the first step was set up initially using the known tetraspanin EV 
markers, anti-CD63 and anti-CD81, and exosomes were conjugated with a pair of fluorescently 
labeled antibodies corresponding to these markers. At that point a positive outcome was 
expected as these markers are both known to be abundant on exosomes. The second step was 
to test the selective detection of exosome from the A549 lung cancer cell model via the use of 
the proposed biomarkers (CD151, GLUT, MYOF and RhoC). Several pairwise combinations 
of CD151, GLUT, MYOF, and RhoC were tested. At the outset of this study, it was uncertain 
whether the biomarkers proposed on the basis of the proteomic analysis in Chapter 3 would be 
present in sufficient abundance and suitably co-located in order to generate a distinct FRET 
signal. Although, the proposed biomarkers were previously validated by western blotting as 
has been shown on Figure 35, neither pairs containing GLUT provided a large reduction in 
fluorescence lifetime and failed to evidence energy transfer in our assay. However, pairs 
containing CD151 showed the highest FRET efficiencies (20.8%). 

The fact that cancer cell growth is associated with increased glucose metabolism was 
reported a long time ago and is well-documented. In addition, many studies had reported the 
overexpression of GLUT protein, especially GLUT-1, in a variety of malignancies and so is 
associated with tumor growth®?® >. One study based оп the proteomic analysis reported 


GLUT-1 and other two exosome membrane proteins as potential breast cancer biomarkers. 
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Тһе GLUT-1 was only detected in exosomes isolated from the breast cancer cell line and were 
absent in non-cancerous epithelial cell-derived exosomes. 

In contrast, the CD151-MYOF and RhoC-CD151 pairs showed high enough FRET 
efficiency to conclude that energy transfer on A549 cell-derived exosomes was successful. The 
clinical significance of CD151 gene expression predicting the behavior of lung cancer has been 
reported in several studies. CD151 gene has been reported that one of the markers associated 
with an early stage of metastasis??". The same gene was shown to be associated with poor 
outcome in respiratory disease, such as asthma, influenza, and pulmonary fibrosis, including 
lung cancer, suggesting that CD151 should be considered as a biomarker that can be used іп 
screening and also as a prognostic (0016. Furthermore, CD151 has been proposed to target 
exosomes to tissues including lung, lymph node, and stroma®?. The clinical usefulness of 
CD151 as a cancer biomarker was illustrated with in a study that demonstrated that CD151 
protein embedded on the surface of exosomes is as powerful a marker of squamous cell cancer 
and SCLC as multimarker models!!. The overexpression of CD151 was also studied in lung 
tissue samples obtained from surgically resected NSLC also suggesting the importance value 
of CD151 as a biomarker 9". In this study, we had presented results of application of CD151 as 
a putative protein biomarker for cancer diagnosis in the FRET assay in two different 
combinations, CD151-MYOF and RhoC-CD151, which were the only two combinations 
showing higher FRET efficiency and therefore successfully detected lung cancer exosomes. 

Although myoferlin has been mostly associated with breast and pancreatic cancer, there 
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is evidence that myoferlin can act as a novel target for detection of lung cancer®, and is а 


potential candidate in clinical diagnosis®!. In addition, a recent study identified its role in 
inhibition of the proliferation of cancer cells®. 

Whereas the overexpression of RhoC proteins in various types of cancer has been 
reported, the role of RhoC protein in cancer still needs to be elucidated®*. A functional study 
of lung cancer metastasis suggested that RhoC contributes to metastasis by mediating cell 
motility but does not affect tumor growth*9^. The overexpression of RhoC proteins was also а 
subject of Lu's research group, who also confirmed the effect of the protein on cell motility, 
proposing it initiates the chain of changes in the epithelia-mesenchymal transitions process, 
which then causes rearrangement in the cytoskeleton contributing to tumor metastasis?*?. In 
addition, RhoC protein belongs to the superfamily of proteins that has been known to be 
involved іп dysregulation of guanosine diphosphate (GDP)-hydrolyze guanosine 


triphosphate (GTP) cycle that plays a crucial role in cancer development$65:666, 
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To validate the hypothesis that proposed biomarkers can only be detected on cancer 
cell-derived exosomes, exosomes from WI-38 cells and human breath were tested. The 
proposed biomarkers were not evident in exosomes from these sources by western blotting or 
FRET (Figs. 35 and 62, respectively). 

It is rare that a single biomarker can provide an accurate diagnosis. The goal of 
biomarker discovery is frequently to identify several biomarkers that can be developed together 
into an assay for tumor diagnosis. There are several examples utilizing FRET with protein 


667 


cancer biomarkers to monitor disease repones in testicular?*", prostate595, breast, and ovarian 


сапсегв670, 


4.1 Limitations 


This study is based on potential biomarkers chosen for their presence іп exosomes from 
a single cell model of lung cancer. It is not known if these are general markers of lung cancer 
or simply markers of the A549 cells and so this study includes some limitations. The success 
of this experimental design was directly related to specific methodological choices. 

Firstly, one of the limitations is assay sensitivity and so the limit of detection needs to be 
established. This is required to determine the volume of EBC required in order to get detectable 
signal in a future clinical diagnostic assay. This is the most important limitation for taking 
future work into clinical trial. Briefly, in this study, the total volume of EBC collected for one 
sample was 34 mL. That required participation of 20 volunteers from which EBC was pooled. 
Given the fact that each volunteer contributed 2 mL on average into a pooled sample breathing 
for 30 min, this would add up to 10 hours of breathing if a sample were to be collected from a 
single individual. This seems almost impossible to expect from a lung cancer patient. 

Secondly, it would be highly desirable not to have to do large amounts of sample 
preparation for a lung cancer bioassay. This study shows that some form of exosome 
enrichment is required as exosome markers cannot be detected in whole breath, however a 
range of exosome enrichment strategies has not been investigated in order to see which would 
be the easiest and quickest to do as part of a diagnostic test, whilst still giving a FRET signal 
from general tetraspanin EV markers. Further work with healthy volunteers is required. 

Finally, the goal of this study was not to identify final biomarkers, but to assess whether 
there was any potential to discover and detect potential biomarkers on exosomes. There was 
no clinical validation to support the proposed biomarkers. It is certainly possible that the 


biomarkers presented in this study might be unable to distinguish between healthy individuals 
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апа those with lung сапсег. Recommendations that need to Бе taken into consideration to 
explore the full potential of breath exosomes are required using additional lung cancer models, 
and ultimately clinical studies to determine the ideal combination of exosome-based 


biomarkers. 


4.5 CONCLUSION 


The aim of the research presented in this chapter was to develop a sensitive test for the 
presence of the selected biomarkers of lung cancer found on exosomes from cell models 
identified in Chapter 3. The aim was achieved through two objectives: labeling the antibodies 
corresponding to the proposed biomarkers, and determination of the FRET efficiency of each 
proposed fluorescent labeled antibody pair by conjugating them with all sources of exosomes 
with the end goal of selecting one to two pairs using a FRET-based assay using time-correlated 
single photon counting. 

This study has shown that the FRET-based assay was able to distinguish between lung 
cancer cell-derived exosomes and exosomes isolated from breath of healthy volunteers and a 
cell model of normal lung tissue based on identified potential biomarkers of lung cancer using 
a proteomic analysis. This study confirms that data from time-resolved studies can be used for 
a sensitive test to confirm the presence of cancer biomarkers. 

Briefly, the concept of using a FRET-based test was first assessed with the general 
tetraspanin EV markers, anti-CD63 and anti-CD81. These findings suggested that this assay 
functioned satisfactorily enough to continue with proposed biomarkers. Further, this aspect of 
the study indicates that a FRET-based assay can be considered as a new way to prove the 
presence of exosomes. This step also confirmed the sensitivity of this assay. 

In this study, four proposed combinations of labeled antibodies corresponding to 
proposed biomarkers were tested. Two of these combinations, RHOC-CD151 and CD151- 
MYOF labeled with fluorophore pairs AF488/546, were effective in distinguishing exosomes 
from a model of lung cancer from exosomes isolated from a healthy lung cell model and human 
breath collected from healthy volunteers. On the other hand, the pairs GLUT-MYOF and 
CD151-GLUT showed no significant difference resulting in absence of FRET signal 
suggesting that the proposed combinations do not have a strong specificity for lung cancer, 
although their signal was previously reported through western blotting. 

The findings of this study suggest that the data from time-resolved studies can be 


applicable for a sensitive test to validate the presence of cancer biomarkers. The study 
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contributes to our understanding that exosome proteins can be used to distinguish between 
healthy and lung cancer exosomes. These results add to the rapidly expanding field of 
exosomes and their use for diagnostics, although further optimization is required as described 


in the next Chapter 5. 
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CHAPTER 5. FUTURE WORK 


The research presented in this thesis opened new opportunities and highlighted several 
topics for further research. 

In this study, the determination of a benchmark healthy lung exosome proteome was 
conducted using samples from patients in a variety of age groups, with a majority representing 
young adulthood (18 to 30 years). Future studies should draw on a range of groups as age 
diversity to ensure that the population is widely represented including geriatric patients. 
Additionally, including sex-gender variables in the research data has been recommended to 
evaluate the risk and benefits for each group®’!. In this study no distinction was made between 
proteins because samples were pooled, and it was not possible to differentiate samples. Gender 
differences and individual variations in comprehensive analysis of proteome samples have 
been previously reported?559?. Another suggestion that can be made is to increase the number 
of volunteers in order to create a larger data set and therefore provide a better insight into the 
breath proteome. 

This study investigated the breath proteome only from healthy volunteers. It will be 
interesting to see how the proteome of exosomes from lung cancer patients differs. 
Unfortunately, due to time limitations, the characterization of human breath proteome from 
lung cancer patients was not possible in this thesis. Characterization of the human lung cancer 
proteome collected from exhaled breath will be critical to verify the utility of the 
biomarkers that were proposed during this research study and validate those markers by 
proposed method of detection, the FRET-based assay. Biomarkers can identify or recognize 
the disease and indicate its stage. Therefore, characterization of the proteome of exosomes from 
the breath of cancer patients ideally needs to be conducted in patients in different age groups 
and with different stages of lung cancer to obtain a full picture. 

Several questions arising from this research should be addressed in future studies. First, 
it will be beneficial to find out if the proposed biomarkers are general for lung cancer or specific 
for A549 cells. For that, further cell models need to be tested. In this thesis, we have only used 
a healthy lung cell model (WI-38), which did behave as we expected. However, it would be 
useful to compare to other cell lines to determine the specificity of the proposed biomarkers. 
Cell models may not fully and accurately reflect the physiological condition they were chosen 
to represent. Validation of the proposed biomarkers will require different cell models. That will 
provide a linkage between the proposed biomarkers and a clinical end report evaluating its 


clinical usefulness. 
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Second, the breath collection technique is a difficult process and crucial for further 
advances in detecting biomarkers from breath. Ideally, the sample collection method should be 
adapted for personal physiology allowing collection to be simple and fast. With the current 
settings, described in this study, the collection will satisfy the goal of characterization of 
benchmark proteome from average of lung cancer patients, but will be difficult to apply on an 
individual as it will take several hours to collect the volume sufficient enough for analysis. To 
overcome these challenges, improving the performance of analytical methods that facilitates 
diagnosis on smaller sample volumes, whilst retaining a high degree of reproducibility, will be 
crucial. Third, before moving to clinical trials to test the proposed diagnostic tool on lung 
cancer patients, it is critical to establish the volume that would be necessary to perform the 
experiment on individual samples. Because in this study we collected and pooled samples, 
establishing the volume of the individual sample was not set as a goal, however it would be 
necessary to determine this for clinical use. Work with healthy volunteers is required to 
determine the limits of detection of the assay, to define the volume of EBC required for a 
clinical diagnostic assay. Once that is determined, clinical trial needs to take place. That will 
include collection of the breath from lung cancer patients following by exosome isolation. The 
putative markers first will need to be verified by western blotting before moving onto FRET 
detection. 

Meanwhile, the proteome of lung cancer exosomes needs to be characterized. This has 
not been done yet, although EBC from cancer patients has been reported. For that, firstly, 
exhaled breath condensate from lung cancer patients needs to be collected and characterized to 
establish exosome-based biomarkers that are characteristic of lung cancer. Secondly, the same 
FRET detection method would be used to verify that the biomarkers can distinguish between 
cancer and healthy exosomes. A preliminary study conducted on a sample from one individual 
gave us a rough idea of the necessary volume to be sufficient for the analysis. One of the 
suggestions that we would like to make is to conduct studies on a series of samples of 
decreasing volumes, but the same concentration of the EBC collected from healthy individuals 
without pooling them together as it will give an excellent representation of volume and 
detection abilities for already established FRET-based assay. 

Third, “multiplexed” detection involving three FRET pairs, or even multiple target 
detection using nano-assembled FRET-based systems can be used to improve sensitivity. For 
example, using exosomes in ensemble with lipids or DNA molecules. Ultimately, further 


development of a simple FRET-based diagnostic assay offers the possibility of a device that 
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сап be used for а point-of-care testing. Application to early diagnosis of lung cancer based on 
exosomes isolated from breath is especially attractive. 
Lastly, the characterization of breath exosomes can be expanded by investigating other 


molecular cargoes, such us RNA and lipids, in healthy and cancerous exosomes. 
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CHAPTER 6. GLOBAL CONCLUSION 


This thesis discusses breath-derived exosomes and their potential use in breath-based 
clinical applications for medical diagnostics for lung dysfunction, including cancer. 

Although exosomes isolated from various biological fluids have been heavily studied and 
reported, breath has been overlooked as another source of exosomesć?!672, Breath is an 
excellent source of exosomes that can be collected non-invasively, which is an advantage over 
some other biological fluids. Although there are several publications acknowledging their 
presence, breath exosomes have been poorly characterized? This study identified and set out 
to fill this knowledge gap. The project's aim was to characterize breath-derived exosomes and 
evaluate whether they can be used in clinical diagnostics. Exosomes contain parent cell- 
specific proteins, suggesting that breath exosomes could be a useful tool for distinguishing 
between healthy and cancerous lungs. This has been recently proven on plasma exosomes ^", 
Therefore, this study focused on exploring the proteome of exosomes secreted from cell models 
of normal lung tissue (WI-38 cells) and lung cancer (A549 cells). It was hypothesized that 
differentially expressed proteins could be used as biomarkers to help to identify lung 
dysfunction, including early-stage cancer. A proof-of-principle test for these biomarkers was 
developed, with a final goal to distinguish between lung cancer and healthy lung exosomes. 

This project was divided into several aims and objectives which were addressed in three 
chapters: isolation and quantification of exosomes, proteomic profiling of cell- and breath- 
derived exosomes and identification of proposed biomarkers, and detection of exosomes- 
diagnostic application. Together, these chapters show the characterization of breath exosomes 
and provide a proof-of-principle for their use as a diagnostic tool. 

Chapter 2 described the optimization of the methodology for exosome isolation. Two 
methods of isolation were discussed: ultracentrifugation and size-exclusion chromatography. 
For exosomes to be used for biomarker identification, it is important to obtain high quality 
exosomes that are free of contaminating protein aggregates or other vesicles. Experimental 
findings from this chapter revealed three key points. First, the methodology for isolation of 
high purity exosomes from EBC was established. The isolated vesicles were confirmed as 
exosomes through immunostaining with gold nanoparticles and were characterized by 
transmission electron microscopy, cryo-scanning electron microscopy, dynamic light 
scattering, and protein quantification assays. Exosomes isolated through SEC fractionation can 
be sufficiently pure and applicable for diagnostic purposes. However, the volume of biological 


fluid is a limitation of this method. On the other hand, samples that are as pure as from size 
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exclusion chromatography сап be obtained through а modified ultracentrifugation method іп 
which a large volume of the initial biological fluid is used. Second, the results suggested that 
highly characterized breath-derived exosomes are phenotypically consistent with exosomes 
from other sources. This was determined by comparing breath-derived exosomes to cell- 
derived exosomes from a lung cancer cell line and a healthy lung cell line. Finally, it was noted 
that buffers used for exosome isolation are likely to become contaminated when stored in 
labware. To avoid possible contamination, it is preferable to use freshly prepared buffers. 

The aim of Chapter 3 was to identify putative biomarkers of lung disease, evaluate 
whether those may be revealed in the proteome of exosomes, and to characterize the proteome 
of breath-derived exosomes. This was achieved through proteomic analysis of the proteome of 
cell- and breath- derived exosomes. Proteomics employs mass spectrometry to detect, identify, 
and characterize proteins” . Analysis of differential protein expression can be used to identify 
candidate biomarker proteins that are present in cancer tissues and absent in normal tissue. The 
identification of clinically useful biomarkers for lung cancer detection in early stages is a 
primary issue and a medical need?^. 

Cell-derived exosomes were used initially to optimize the protocol. The fact that the cell 
count can be easily controlled allowed an estimation of the number of proteins that would be 
sufficient for proteomic analysis. Once the protocol was optimized, proteomic analysis of 
breath-derived exosomes was attempted, beginning with a sample collected from one 
individual only. The 2 mL starting volume of EBC collection did not provide enough proteins 
for detection, after accounting for peptide loss during the purified step using C18 tips. After 
increasing the volume collected and performing extensive optimization, visible protein peaks 
were finally seen. To determine the benchmark of a healthy breath-exosome proteome we 
conducted a study in which 55 healthy participants donated their EBC. Since our goal was to 
define the average of breath proteome, single samples were not analyzed but pooled together. 
Participants were divided into three groups of 20 and EBC samples were pooled and combined 
within the group within three days. Then, breath exosomes were isolated and confirmed 
biochemically and electron microscopy. The exosomes were then subjected to in-depth 
characterization by LC-MS/MS analysis. 

The significant proteins shared across three breath proteome samples were then analyzed 
through gene ontology functional enrichment analysis to define the core of the breath exosome 
proteome. Characterization of the breath exosome proteome has never been previously 
reported. Bioinformatics techniques were used to compare the benchmark proteome of 


exosomes derived from human breath to the proteome of exosomes obtained from WI-38 cells 
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to show that these cells were a valid proxy for breath-derived exosomes. The benchmark of the 
breath exosome proteome from healthy volunteers allows for future comparison to exosomes 
obtained from conditions of lung dysfunction, including lung cancer. We foresee their use in 
non-invasive breath-based clinical applications for medical diagnostics for lung cancer as an 
important future application. Another significant finding included potential lung cancer 
biomarkers identified from cell-derived exosomes. We reported nine biomarkers, four of which 
were selected as most likely to be efficient in identifying early-stage lung cancer. Proposed 
biomarkers were validated through the western blotting, which confirmed the absence of cancer 
biomarkers on healthy lung cell-derived exosomes and breath-derived exosomes collected 
from volunteers. 

Chapter 4 combined findings from the previous two chapters and applied them into 
development of a sensitive test that should recognize proposed cancer biomarkers on lung 
cancer exosomes. The FRET-based assay was developed to detect the interaction between 
exosomes and putative lung cancer biomarkers with a final goal to distinguish between lung 
cancer and healthy lung exosomes. The assay involved a pair of antibodies corresponding to 
proposed biomarkers, one of them labeled with a donor fluorophore and the other labeled with 
an acceptor fluorophore. These were incubated with the exosomes in solution. Based on our 
assumption that proposed biomarkers are present at a low abundance in the samples, we were 
looking for a sensitive detection method. As proof-of-principle, we tested the chosen 
fluorophore pair on known general tetraspanin EV markers CD63 and CD81 which are 
embedded on the surface of exosome before moving on to antibodies corresponding to 
proposed biomarkers. Calculation of the lifetime decays of the donor in the proposed FRET 
pairs compared to controls showed that the FRET-based assay can distinguish between 
cancerous and non-tumorigenic exosomes. In conclusion, the thesis has discussed the 
application of breath-derived exosomes for non-invasive diagnosis of lung health by using the 


developed FRET-based assay with the ultimate goal of early-stage diagnosis of lung cancer. 
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Figure S 3. Western blot analysis of МҮОЕ, GLUT, апа ЕһоС proposed biomarkers оп 
all samples (breath- and cell-derived exosomes). 


Figure S 4. Western blot analysis of CD151 proposed biomarker on all samples (breath- 
and cell-derived exosomes). 


Figure S 5. Western blot analysis of general tetraspanin EV marker Alix on all samples 
(breath- and cell-derived exosomes). 


211 


Figure S 6. Western blot analysis of general tetraspanin EV markers Alix, CD63, and 
CD81 оп all samples (breath- and cell-derived exosomes). 


Figure S 7. Western blot analysis of proposed biomarkers MYOF and RhoC оп cell (WI- 
38/4549)-derived exosomes and whole proteome of the same cell line. 


Figure S 8. Western blot analysis of proposed biomarker MYOF оп cell (WI-38/A549) - 
derived exosomes and whole proteome of the same line. 
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Figure S 9. Western blot analysis of proposed biomarkers MYOF and RhoC (2 attempt) оп 
cell (WI-38/A549)-derived exosomes. 


GLUT ——— 


Figure S 10. Western blot analysis of proposed biomarker GLUT on cell (WI-38/A549) - 
derived exosomes. 
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Figure S 11. PL lifetimes of dyes and conjugated dyes. PL lifetimes obtained by time-correlated single photon counting 
(TCSPC) (dex — 380 nm; 44,7520 пт) of dyes and conjugated dyes with fluorescently labeled antibodies corresponding to the 
general tetraspanin EV markers (CD63 and CD81)and proposed biomarkers (CD151, GLUT, MYOF, and RhoC). The Y-axis 
represents normalized photon counts, and X-axis represents decay time (ns). The instrument response (IRF) is shown on each 
panel in black (dashed). The experiment was performed on three biological samples and run in duplicate. The error bars 
represent standard deviation. Some error bars are not visible as the data in the duplicates were close to identical. The figure 
was generated by using Prism GraphPad Software (version 9.1.1) for macOS, San Diego, California. 
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LIST ОЕ TABLES 


Table $ 1. Cell quantification data for А549 cell line. The table showing the cell average resulted from counted per square 
of the hemocytometer ( x10*),the number of those cells that were live or dead as revealed by trypan blue staining (х107), 
viability during passaging, and total cell count over12 days. 


Day of the experiment Cell average | Number of | Number of Viability, V of CCM, mL Total cells 
live cells dead cells % 
per square | per square 

Day 1 (T25 flask 1) 71 69 2 n/a 4 

4830000 
(T25 flask 2) 63 60 3 n/a 4 

4200000 
(T25 flask 3) 62 60 2 n/a 4 

4200000 
Day 3 (T75 flask 1) 237 235 2 99 11 

32900000 
(T75 flask 2) 138 137: 1 99 11 

19180000 
(T75 flask 3) 146 144 2 99 11 

20160000 
Day 6 (T75 flask 1) 118 116 2 98 11 

16240000 
(T75 flask 2) 119 117 2 98 11 

16380000 
(T75 flask 3) 126 125 1 99 11 

17500000 
(Т75 flask 4) 120 18 2 98 11 

16520000 
(Т75 flask 5) 275 270 s 98 11 

37800000 
(T75 flask 6) 136 133 3 98 11 

18620000 
(Т75 flask 7) 161 158 3 98 11 

22120000 
(T75 flask 8) 168 165 3 98 11 

23100000 
Day 9 (T75 flask 1) 418 413 5 99 11 

57820000 
(T75 flask 2) 421 415 6 99 11 

58100000 
T75 flask 3-5 combined) 418 413 5 99 11 

57820000 
Т75 flask 6-9 combined) 420 415 5 99 11 

58100000 
T75 flasks 10-11 combined 285 283 2 99 11 

39620000 
Day 12 ( 775 flask 1) 114 122 2 98 11 

17080000 


Table S 2. Raw list of proteins found at least once with a high quality peptide match identified in at least one sample of A549 


cell-derived exosomes. 


Protein ID Protein ID Protein ID 
D6RF35 P10643 P60033 
000299 P10809 P60174-1 
000468-5 P10909-1 P55263-1 
000560-1 P11021 P60660 
014950 Р11047 Р60709 
015230 Р11142-1 Р60842 
015427 P11166 P60900 
015484 P11413-1 P60953 
043707 Р12109 Р60981-1 
043795 P12110 P61224-1 
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060218 РІ2111 Р61247 
060701 Р12236 Р61254 
060814 Р12259 Р61353 
075083 P12814-1 P61604 
075367-1 P13473-1 P61981 
075369-1 P13639 P62241 
075955 P13645 P62258-1 
095084 P13647 P62266 
095445-1 P13929-1 P62424 
095497 P13987 P62701 
P00338-1 P14618 P62736 
P00352 P14625 P62753 
P00390 P14923 P62805 
P00488 P15121 P62820 
P00505 P15151-1 P62826 
P00533-1 P15311 P62854 
P00558 P15531 P62873 
P00734 P15559-1 P62879 
P00747 P15880 P62906 
Р01008 Р16070 Р62910 
Р01023 Р16401 Р62937 
Р01024 Р16402 Р62987 
Р01031 P17812 P63104-1 
Р01116 Р18124 Р63267-1 
Р01857 Р18206 Р68104 
Р02042 Р19338 Р68363 
Р02452 Р19823 Р68366 
Р02461 Р20742 Р68371 
Р02649 Р20774 Р69905 
Р02656 Р20908 Р78527 
Р02675 P21333 P80723 
P02679 P21399 P84103 
Р02748 Р21796 P84243 
P02749 P21926 P98160 
P02751 P22314 P99999 
P02753 P22626 Q00325-1 
P02765 P22692 Q00610-1 
P02768-1 P22695 Q00796 
P02771 P23142 Q00839 
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Р02774 P23284 Q01650 
P02786 P23396-1 Q02241-1 
P02787 P23526-1 Q02543 
P02788 P23528 Q02878 
P02794 P25705-1 Q04756 
P04004 P26006 Q06033-1 
P04075 P26038 Q06830 
P04083 P26641 Q07020 
P04114 P27105 Q07954 
P04264 P27635 Q08380 
P04406-1 P29401 Q08431 
P04745 P29692 Q12805 
P05023 P30041 Q12860 
P05026 P30048 Q13011 
P05106 P30101 Q13103 
Р05141 Р31689-1 013200 
Р05156 Р31939 013219 
Р05387 P33527 Q13418 
P05388 P34931 Q13509 
P05452 P35232 Q13885 
P05543 P35268 Q14204 
P05556-1 P35443 Q14254 
P05783 P35527 Q14533 
P05787 P35579-1 Q14764 
P06396 P35580 Q14847 
P06576 P35908 Q14974 
P06733-1 P36578 Q15063-1 
P06744 P36955 Q15084-1 
P06748 P38646 Q15365 
P06756 P39023 Q15493-1 
P07195 P39060-1 Q15582 
P07237 P40227-1 Q15758-1 
P07355 P40926 Q16658 
P07437 P42330 Q16777 
P07602-1 P43652 Q16778 
P07737 P46781 Q16853 
P07814 P48509 Q16891 
P07900 P49327 Q562R1 
P07942 P49368-1 QSQNW6-1 


217 


P07996 P49747 Q71U36 
P08123 P50552 Q71UI9-1 
P08134 P50914 Q7L7LO 
Р08195 Р50990 Q8WUA8 
P08238 P50991 Q8WUM4 
Р08697-1 Р50995 Q8WWI5 
P08729 P51570 Q969X5 
P08754 P51884 Q9H4B7 
P08758 P52209 Q9H8M2-1 
Р08865 Р52895 Q9NQHT7-1 
P09525 P53396-1 Q9NZMI 
P09651-1 P54652 Q9UGM3-1 
POCOLA-1 P55072 09Ү490 
PODMV8 P55263-1 Q9Y4K0 


Protein ID Protein ID Protein ID 
A0A087X1T7 PODMV8 Р62753 
AOA3B3IRI8 P10451-1 P62805 
A6NMY6 P10643 P62826 
A6NMZ7-1 P10909-1 Р62873 
B9A064-1 Р11047 Р62899 
C9ICIS Р11142-1 Р62910 
D6RF35 P11413-1 P62937 
E9PFT6 P11717 P62987 
G3VIN2 P11766 P63104-1 
H7BZW2 P12109 P63267-1 
O00468-5 P12110 P68366 
O00560-1 P12111 P68371 
O15031 P12259 P69905 
O15230 P12814-1 P78509 
O15484 P13639 P78524-1 
O43505 P13645 P84243 
O43707 P14550 P98160 
O43765 P14618 Q00266 
O43854-1 P14625 Q00610-1 
O60218 P15531 Q01518-1 
O75083 P15880 Q01955-1 


Table $ 3. Raw list of proteins found at least once with а high quality peptide match identified іп at least one sample of WI- 
38 cell-derived exosomes. 
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075369-1 P16070 Q02388-1 
(075955 Р16152 004721 
095466 P16402 Q04756 
095497 P17301 Q05639 
Р00338-1 Р17405-1 006033-1 
Р00352 Р18206 006210-1 
Р00558 Р19320-1 006828 
Р00734 Р19338 006830 
Р00738 Р19823 007954 
Р00742 Р19827-1 008431 
Р00747 Р20742 012805 
Р00966 Р20774 012841 
Р01008 Р20908 013103 
Р01023 Р21266 013162 
Р01024 Р21333 013200 
Р01031 Р21399 013219 
Р01033 Р21810 013308-1 
Р01266-1 Р21980 013332 
Р01344 Р22105 013418 
Р01614 Р22314 013509 
Р01857 Р22626 013614-1 
Р01861 P22692 014112-1 
Р02100 Р23142 014117 
Р02452 Р23526-1 014204 
Р02458-1 P23528 Q14520-1 
Р02461 Р23921 014624-1 
Р02545 Р24043 014764 
Р02649 Р24534 014974 
Р02656 Р24821 015063-1 
Р02671-1 Р25786-1 015274 
Р02675 Р25789 015493-1 
Р02679 Р26038 015582 
Р02748 Р26447 016270-1 
Р02749 Р27105 016610 
Р02751 Р28827-1 016777 
Р02765 P30041 016778 
Р02768-1 Р30050-1 Q16787-2 
P02771 P30086 016819 
Р02787 P30101 016853 
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P02788 Р30153 Q2TBE0 
P02790 P31146 Q5JP53 
P03956 P31939 QSQNW6-1 
P04004 Р34096 Q6BOK9 
P04040 35052 Q6NVVI 
P04075 P35080-1 QORFHS-1 
Р04083 P35443 Q71U36 
P04114 P35527 Q7Z3S9-1 
P04180 P35555 Q7Z4W1 
P04264 P35556 Q86UX7 
P04275 P35579-1 Q8N2S1 
P04406-1 P35908 Q8ND82 
P04792 Р36578 Q8NG1I-1 
P05062 P36955 Q8WUM4 
P05106 P37802 Q92520 
Р05121-1 Р38606 092626 
Р05156 Р43652 Q96F07 
P05452 P46531 Q96KP4 
Р05556-1 P47914 096112 
Р05783 Р48740-1 Q96P31-1 
P05787 P49189 Q99460 
P06396 P49327 Q99715-1 
P06727 P49368-1 Q99983 
P06733-1 P49746 Q9BQ51-3 
P06737-1 P49747 Q9BXJ4 
P06744 P50990 Q9H0U4 
P07195 P51148 Q9H2G4 
P07358 P51884 Q9H8M2-1 
P07437 Р51991-1 Q9NR21-1 
Р07585 PS52895 QONRNS-1 
Р07737 P54289 Q9NRY6 
Р07900 P55072 Q9NY65 
P07942 P55263-1 Q9NZN3 
Р07996 P55287-1 Q9P2TI-1 
P08123 P55290 Q9UBGO 
P08238 Р57721-1 Q9UBR2 
P08253 P60174-1 Q9UGM3-1 
P08670 P60660 Q9ULAO 
P08697-1 P60709 Q9UM47 
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Р08758 Р60900 090МҮ4-1 
P09382 P61160-1 Q9UNPO9-1 
P09455-1 P61224-1 Q9Y240 
P09486 P61353 Q9Y490 
P09972 P61981 Q9Y4K0 
POCOL4-1 P62191 09Ү696 
PODMV8 P62263 Q9Y6Z7 
P10451-1 P62269 U3KQF2 
P10643 P62491-1 


Table S 4. Raw list of proteins found at least once with a high quality peptide match identified in at least one sample of 
whole proteome of A549 cells. 


Protein ID Protein ID Protein ID Protein ID Protein ID 
A0A075B6Z8 P02748 P21796 P60174-1 Q16778 
A0A087WT76 P02749 P21926 P60228 Q16819 
A0A087WUL9 P02751 P21980 P60660 Q16853 
A0A087WX80 P02753 P22033 P60709 016881-1 
A0A087WYX9 P02765 P22105 P60763 Q16891 
A0A0A0MRM9 P02768-1 P22234 P60842 Q2M389 
A0A0D9SEM4 P02771 P22307-1 P60900 Q3SY69-1 
А0А0/9ҮХ25 Р02774 P22314 P60903 Q4G163 
A0A1W2PRJ5 P02786 P22626 P60953 Q4KMQ?-1 
A0A2R8Y656 P02787 P22692 P60981-1 Q53GQ0 
A0A2R8YG48 P02788 P22695 P61019-1 Q562R1 
A0A3B3IRI8 P02790 P23141 P61026 Q58FF6 
AOA3B3ISY1 P02794 P23142 P61106 QSJWF2-1 
ASA3E0 Р03951 Р23142-4 Р61129 Q5M9NO0-1 
A6NLM8 P04004 P23229-1 P61158 QSQNW6-1 
A6NMY6 P04040 P23246-1 P61160-1 055515-1 
B0QZK4 P04075 P23284 P61163 057457-1 
BIAK85 P04083 P23381 P61204 Q5T749 
B4DLNI P04114 P23396-1 P61221 05Т7Е0 
B4DT28 P04179-4 P23526-1 Р61224-1 QST8R3 
В5МСУ4 Р04180 Р23528 Р61247 QSTCZI 
В5МЕ19 Р04259 Р23634-1 Р61254 05Т630 
B7ZKQ9 P04264 P23921 P61313-1 050649 
B9A064-1 P04275 Р24043 Р61353 Q5VW38 
C9J3L8 P04406-1 P24534 P61421 Q5W0H4 
C9J5X1 P04745 P24752 P61604 Q658P3-1 
C9J7B4 P04792 P25398 P61758 Q6DD88 
C9ICIS P04843 P25445-1 P61769 Q6IAA8 
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D6RC77 P04899-1 Р25705-1 Р61970 061042 
D6RF35 P05023 P25786-1 P61978 Q6NVVI 
D6RH06 P05026 P25788-1 P61981 Q6NZD-1 
ESRHG9 P05062 P26006 P62072 Q6P2Q9 
ESRI99 P05067-1 P26038 P62081 Q6QEF8-1 
ESRIE3 Р05091 Р26373-1 Р62136-1 067МР0-1 
E7EN89 Р05106 Р26447 Р62140 Q6ZNII-1 
E7EQS8 P05121-1 P26583 P62158 Q6ZS81-1 
E7ESP4 P05141 P26599 P62191 Q71U36 
E9PCY7 Р05156 Р26641 Р62195-1 0711/9-1 
E9PKY7 P05386 P26927 P62241 Q7L576-1 
F5GY03 P05387 P27105 P62244 Q7L7LO 
F5H608 P05388 P27348 P62249 Q7LBRI 
F8WAV9 P05452 P27635 P62256 Q7Z3U7-1 
G3VIN2 P05534 P27708 P62258-1 Q7Z4W1 
G3V2MI P05543 P27797 P62266 Q86UX7 
G3V5G2 P05546 P27824 P62269 Q86VP6-1 
HOY2N6 P05556-1 P28074-1 P62277 Qs6Y82 
Н0Ү835 Р05783 Р28300 Р62280 Q8IUE6 
H0YAF8 P05787 P28331-1 P62316 Q8IUL8 
H0YB56 P06132 P29144 P62318 Q8IV33 
HOYCNI Р06396 Р29317 P62333 081783 
H0YF61 P06576 P29401 P62424 Q8IZL8 
H0YJU2 P06727 P29692 P62491-1 Q8IZP2 
H3BMNI P06733-1 P29966 P62701 Q8N0Y7 
H3BN98 P06737-1 P30040-1 P62714 Q8N2SI 
H3BPGO P06744 P30041 P62736 Q8N474 
H3BRJO P06748 P30043 P62753 Q8N7P7 
H7C106 P06753-2 P30044-1 P62805 Q8NAFO 
H7CIS9 P06756 P30048 P62820 Q8NBS9-1 
ІЗІЛАЗ Р07099 Р30050-1 Р62826 Q8NEMO 
DLIPS P07195 P30086 P62829 Q8NFJS 
13L397 P07225 P30101 P62854 QS8NGII-I 
13L425 Р07237 Р30153 Р62857 Q8NI77 
J3KND8 P07305 P30626 P62861 Q8TAA3-1 
J3KRJ1 Р07339 P30626-1 P62873 Q8TB61 
ІЗКЕХ5 Р07355 Р30838 Р62879 О8ТЕХ9-1 
ІЗКЕҮЗ Р07357 P31040 P62888 Q8WTV0-1 
J3KTD3 Р07437 P31146 P62906 Q8WUAS 
130545 Р07602-1 Р31689-1 Р62910 Q8WUM4 
J3QSF4 Р07711 Р31930 Р62913 Q8WWIS 
K7EM90 P07737 P31939 P62917 Q8WWZS-l 
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K7EMEO P07814 P31943 P62937 Q8WZ75-1 
K7EMRI P07858 P31946 P62987 Q92522 
К7ЕК96 Р07864 Р31948 Р63000-1 092539 
М00Ү96 Р07900 Р31949 Р63010-1 092542-1 
МООҮУУ9 Р07910-1 Р32119 Р63104-1 092598 
M0R3F1 P07942 P32856 P63220 Q92599-1 
N0E644 P07947 P32969 P63244 Q92608-1 
O00154 P07996 P33527 P63267-1 Q92614-1 
O00159-1 P08123 P34096 P67870 Q92616 
O00186 P08133-1 P34897-1 P67936 Q92734-1 
O00231 P08134 P34931 P68032 Q92743 
O00232-1 P08174-1 P34932 P68104 Q92859 
O00264 P08195 P35052 P68363 Q92887 
O00299 P08238 P35080-1 P68366 Q92930 
O00338 P08263 P35222 P68371 Q92954-5 
O00391 P08572 P35232 P68400 Q92973-1 
O00468-5 P08582-1 P35241-1 P69905 Q93088 
O00468-6 P08670 P35268 P78371-1 Q93099 
O00469-1 P08697-1 P35443 P78417 Q969H8 
O00483 P08708 P35527 P78527 Q969X5 
O00487 P08729 P35555 P80723 Q96AG4 
O00560-1 P08754 P35556 P82909 Q96AXI 
000567 Р08758 Р35579-1 P83731 Q96BM9 
000571 Р08779 Р35580 P84095 Q96CW1 
000764-1 Р08865 Р35590 P84098 Q96DG6 
014672 Р08962 Р35659-1 P84103 Q96EU6-2 
014744 P09211 P35908 P84243 Q96F07 
014786-1 P09382 P35998 P98088 Q96FJ2 
014828 P09429 P36268 P98160 Q96FN4 
014949 Р09486 Р36507 Р99999 Q96FW1 
014950 Р09493-1 Р36542-1 Q00325-1 Q96HC4 
014980 P09496-1 P36578 Q00610-1 Q96HU8 
015027-1 Р09525 Р36871-1 000796 096116 
015027-4 P09543-1 P36955 000839 0961Ү4 
015031 P09622 P36957 001433-1 Q96KG7-1 
O15067 P09651-1 P37802 Q01469 Q96N64 
O15127 P09936 P38117 Q01518-1 Q96N76 
O15230 P09972 P38606 Q01650 Q96QK1 
O15305 POCOL4-1 P38646 001813 Q96QV6 
015400 P0CG39 P39023 Q01955-1 096597 
015427 PODMV8 P39060-1 Q02241-1 Q99497 
O15484 P10114 P39656 Q02388-1 Q99536 
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O43143 P10301 P40227-1 Q02413 Q99623 
O43242 P10451-1 P40429 Q02543 Q99714-1 
O43252 P10606 P40925 Q02818 Q99732 
O43324-1 P10620 P40926 Q02878 099816 
043396 Р10643 Р40939 002978 099829 
043491-1 Р10768 Р41250 004721 099832 
043505 Р10809 Р41252 004756 098526 
043570 Р10909-1 Р42224-1 004837 Q9BSJ8 
O43633 Р11021 Р42330 005193-1 Q9BUJ2-1 
043657 Р11047 Р42704 005639 Q9BV38 
043707 Р11142-1 Р42766 006033-1 Q9BVK6 
043747-1 РІ1166 Р43034 006210-1 Q9BX79-1 
043795 P11169 P43243 006828 Q9BXS4 
060218 РІ1171-1 Р43490 006830 Q9BYT8 
O60462-1 P11233 P43652 Q07020 Q9C0G0 
O60488-1 P11279 P43686 Q07021 Q9H078-1 
O60610-1 P11362 Р45880 007954 Q9H0E2 
O60701 P11387 P46459 Q07955-1 Q9H0H5 
O60716-1 P11413-1 P46531 Q08209-1 Q9HIE3 
O60812 P11586 P46776 Q08211 Q9H2G4 
O60814 P11717 Р46777 008257 Q9H2U2 
O60832-1 P11766 P46778 Q08380 Q9H361 
O60841 P11908 P46779 Q08431 Q9H3N1 
O60884 P12004 P46781 Q08722 Q9H3Z4 
060888-1 P12109 P46782 008945 Q9H4B7 
O75083 P12110 P46783 Q09666-1 Q9H4F8 
O75131 P12111 P46821 Q0IIM8 Q9H4G4 
O75144-1 P12236 P46940 Q12797 Q9H4M9 
O75179-1 P12259 P47755 Q12805 Q9H583 
O75340 P12268 P48047 Q12841 Q9H6X2 
075351 Р12429 Р48059 012860 Q9H832 
075352 Р12814-1 Р48506 012907 Q9H8M2-1 
075367-1 Р12931 Р48509 012913-1 Q9H9B4 
O75369-1 P12955 P48556 Q12931 Q9HB63-1 
075382-1 Р12956 Р48740-1 012965 Q9HBH0 
O75390 P13473-1 P48740-2 013011 О9НВ11-1 
О75400 Р13489 Р49023 013057 Q9HC10 
075531 Р13497 Р49189 013103 Q9HC38 
075695 Р13639 Р49207 013162 Q9HCK1 
075746 Р13645 Р49327 013200 Q9HD45 
O75751 P13647 P49368-1 Q13219 Q9HDC9 
O75807 P13667 P49411 013418 Q9NP58 
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075874 Р13693 Р49588 Q13428-1 Q9NPAO 
O75891-1 Р13797 Р49591 013509 Q9NPH3 
O75897 P13804-1 P49720 Q13595-1 Q9NQC3 
O75955 P13929-1 P49721 Q13614-1 Q9NQHT7-1 
076003 Р13987 Р49747 013642-1 Q9NR30-1 
O94907 Р14314 Р49755 Q13748-1 QONRNS-1 
094925-1 P14550 Р49773 013813 Q9NRP0-1 
O95084 P14618 P49888 Q13885 Q9NUM4 
O95197 P14625 P50395-1 014019 QONV96 
095297-1 Р14923 Р50443 Q14112-1 Q9NY33 
095373 P14927 P50454 014117 Q9NYH9 
O95445-1 Р15121 Р50552 014126 Q9NYL9 
O95497 Р15151-1 Р50914 014152 Q9NYU2-1 
O95573 P15291 P50990 Q14160 095218 
095782 P15311 P50991 Q14204 Q9NZMI 
O95819 P15531 P50995 Q14240-1 Q9NZN3 
095994 P15559-1 P51148 Q14254 QONZN4 
096005-1 P15586 P51149 Q14332 Q9P035 
P00338-1 P15880 P51570 Q14344 Q9P0L0 
P00352 P15924-1 P51587 Q14515 Q9P0W5-1 
P00367 P16070 P51659-1 Q14520-1 Q9P2B2 
P00390 Р16152 Р51665 014533 09Р215 
Р00403 Р16401 Р51674 014624-1 Q9UBG0 
P00439 P16402 P51884 Q14697-1 Q9UBR2 
P00450 P16435 P51957-3 Q14764 Q9UGM3-1 
Р00488 Р16615 Р51991-1 014847 Q9UHDS-1 
P00505 P17301 P52209 Q14914 Q9UHN6-1 
P00533-1 P17405-1 P52272 Q14974 Q9UHV9 
P00558 P17655-1 P52815 Q14978-1 09171 
Р00734 P17812 P52895 Q14997 Q9UL25 
P00738 P17844 P52907 015019 Q9ULV4 
P00740 P17987 P53396-1 Q15029 Q9UM00-1 
P00742 P18065 P53634-1 Q15046 Q9UM47 
P00747 P18077 P53985 Q15063-1 Q9UMS4 
P00966 P18084 P53990-1 Q15084-1 Q9UMY4-1 
P01008 P18085 P53999 015102 Q9UNN8 
P01023 P18124 P54289 Q15149-1 QOUNP9-1 
Р01024 Р18206 Р54577 015185 QOUPYS 
Р01031 Р18428 Р54578 015286 09Ү223-1 
Р01042 P18621 P54652 Q15293 Q9Y230 
Р01116 Р18669 Р54709 015323 09Ү240 
Р01266-1 Р18848 Р54920 015365 09Ү265 
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Р01591 Р19022 Р55010 015366-1 Q9Y277 
P01834 P19338 P55011 Q15404-1 Q9Y2A7-1 
Р01857 Р19440 Р55058 015493-1 Q9Y2G3 
P02042 P19823 P55072 015582 Q9Y315 
P02452 P19827-1 Р55083 015758-1 Q9Y3B3-1 
P02458-1 P20020 Р55209 015836 Q9Y490 
P02461 P20073-1 P55263-1 Q15907 Q9Y4K0 
P02462 P20618 Р55290 015942 09Ү512 
P02533 P20645 P55884 QI6181 Q9Y617-1 
P02545 P20674 P56537-1 Q16394 Q9Y624 
P02647 P20742 P56539 Q16555-1 Q9Y678 
P02649 P20774 P57721-1 Q16563 Q9Y694-1 
P02656 P20908 P58397 016610 Q9Y696 
Р02671-1 Р21291 Р59190-1 016658 09Ү6С9 
Р02675 P21333 Р59998 016719 09Ү6М5 
Р02679 Р21399 Р60033 016777 


Table S 5. Raw 
38 cells. 


list of proteins found at least once with a high quality peptide match identified in at least one sample of WI- 


Protein ID Protein ID Protein ID Protein ID Protein ID 
A0A087WUV5 P09936 P53621-1 Q16775 Q99961 
A0A087WWGI P09960-1 P53634-1 Q16777 Q9BPX3 
AO0A087WYTS P09972 P53677 Q16822 Q9BPX5 
A0A087X295 РОС7Р4 Р53801 016850-1 Q9BPX7 
A0A0A0MR09 РОС7У8 Р53814-1 016851 Q9BQ39 
AOAOAOMRF6 POCG12 P53985 016881-1 09В051-1 
А0А0А0М5К5 PODJG4 P53992 Q16891 Q9BQ67 
А0А0А0М764 PODMV8 P53999 QIKMD3 Q9BQ70 
A0A0A0MTQ5 P10109 P54105 Q27J81 Q9BQAI 
A0A0C4DGA7 P10253 Р54136-1 02М218-1 Q9BQB6-1 
А0А0САрН46 Р10301 P54259 Q2TAA2 Q9BQE3 
AOAQUIRQK7 P10319 P54289 Q32MZ4-2 Q9BQES 
A0A1W2PNV4 Р10515 Р54577 Q32MZ4-4 Q9BQGO 
AOA1W2PQF8 P10599-1 P54578 Q32P28-1 098058 
A0A1W2PRK2 P10606 P54652 Q3KQU3 Q9BR61 
А0А2В8Ү4МІ Р10619 Р54687 Q3KQV9 Q9BR76 
AOA2R8YE10 P10644 P54725-1 031159 Q9BRA2 
A0A2R8YFE2 P10768 Р54727 Q3LXA3 Q9BRFS8-1 
A0A2R8YH03 P10809 PS54764 Q3MHD2 Q9BRI2 
A0A3B3ITJ4 P10909-1 PS54819 ОЗҮЕСТ-1 О9ВЕРІ 
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АОАЗВЗІТХ4 P11021 P54886 Q4G0F5 Q9BRP8 
A0A3B3IU69 P11047 P54920 Q4J6C6 Q9BRR6-1 
АОАУТІ-1 P11142-1 P55010 Q4L180 Q9BRX2 
AOFGR8-1 P11166 P55036-1 QS2LJO Q9BRZ2-1 
A0MZ66 Р11171-1 Р55060-1 QS3EPO Q9BS26 
А1Х283 P11177 P55072 053ЕА7-1 Q9BSC4-1 
A2RRPI-1 P11216 P55084 Q53GQ0 Q9BSD7 
ASYKK6 P11279 P55145 Q53GS9 Q9BSJ2-1 
A6NDG6 P11362-16 Р55209 Q53H82 Q9BSJ8 
A6NEC2 P11387 P55263-1 Q53H96 Q9BT78 
A6NHR9-1 P11388-1 P55265-1 QS3SF7-1 Q9BTCO-1 
A6NKD9 P11413-1 P55268 053759 Q9BTEI 
ATXZE4 P11586 Р55735-1 053ТҺ4 О9ВТЕЗ-1 
A8CG34 P11717 P55769 Q562R1 Q9BTE6-1 
В4рҮ09 Р11766 Р55809 056706 Q9BTTO 
BSMCF9 P11802 Р55884 Q58FG1 Q9BTU6 
В5МЕ19 Р11940-1 Р55957-1 QSBKZI Q9BTV4 
С91264 Р12004 Р56181-2 QSEBLA-1 Q9BTW9 
C9JE98 P12081 P56182 QSGLZ8-1 Q9BTY7 
C9JM79 P12109 P56192 QSH9L2 Q9BTZ2 
C9JRD2 P12110 P56199 QSH9R7-1 Q9BU23-1 
COIRIS Р12111 Р56211 QSHYI8 Q9BUFS 
D6R992 Р12236 Р56377 QSJRA6-1 Q9BUH6 
D6RBW3 P12268 P56385 QSIJRX3 Q9BUJ2-1 
D6REX3 P12270 P56537-1 Q5JSH3 Q9BULS-1 
D6RHI9 P12277 Р56539 Q5JTD0-1 Q9BUL8 
ESRHCI P12814-1 P56545 QSJTJ3-1 Q9BUQS 
ESRHF2 P12955 Р56945 051У50 Q9BUTI-1 
ESRHL7 Р12956 Р57088 Q5JWF2-1 Q9BV20-1 
ESRK69 P13010 P57740 Q5JXB2 Q9BV38 
E7EQB8 P13073 P57764 Q5QJE6 Q9BV44 
E7EVJ3 P13473-1 P57772 QSRGS4 Q9BV81 
E9PGT6 P13489 P58004 QSSRES Q9BV86-1 
E9PHYS P13639 P58546 QSSSJ5-1 Q9BVC6 
E9PMS6 P13645 P59998 QSSW79-1 Q9BVK6 
E9PS63 P13647 P60033 QSSXMS8 Q9BVP2 
ЕІТОП Р13667 Р60174-1 QSTOWO Q9BW27 
F2Z2J1 P13674-1 P60228 05Т115 Q9BW60 
Е5Н479 P13693 P60468 QSTIMS Q9BWDI 
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Е6ТВ26 P13797 Р60660 057200-1 Q9BWJ5 
F8WANI P13798 P60709 Q5T447 Q9BX95 
F8WCJ7 P13804-1 P60842 QST4B2-1 Q9BXB4 
G3VIRS P13861 Р60866 057487-1 Q9BXF6 
G5E9Z2 P13987 P60891 Q5T4US Q9BXI6 
GSEA48 P14174 P60900 057653 Q9BXJ9 
H0Y3N9 P14324 P60903 QST6F2 О9ВХР5 
HOY3T6 Р14550 Р60953 QST8D3-3 Q9BXS5-1 
Н0Ү5В0 Р14618 Р60981-1 057913 Q9BXX3 
H0Y7B8 P14618-2 P60983 QSTAH2 09ВҮ43 
HOYA82 Р14625 Р61006 QSTAP6 09ВҮ44-1 
Н0ҮВ16 P14678-1 P61009 QSTCZ1 Q9BY77 
HOYBWA P14854 P61011-1 QSTFE4 Q9BYM8 
HOYHW7 Р14868 Р61019-1 QSTFQ8 Q9BYNO 
HOYIVO P15121 P61026 QSVT25-1 Q9BYV8 
H0YJV4 P15144 P61077-1 Q5VY43 Q9BZE4 
HOYL43 Р15151-1 P61081 QSVYK3 Q9BZF9 
HOYL68 P15153 P61158 QSVYS8-1 Q9BZGI 
HOYNJO Р15170-3 Р61160-1 QSVZES Q9BZK3 
H3BMM9 P15311 P61163 QSVZLS Q9BZLA 
H3BND8 P15374 P61204 Q63HN8-4 Q9BZQ8 
H3BUZS P15407 P61218 Q63HR2 Q9BZV1 
H7BZJ3 P15531 P61221 Q63ZY3-1 Q9BZX2-1 
H7BZT7 P15559-1 P61247 066К14-1 Q9BZZ5-4 
Н7С058 Р15586 P61254 Q66K74 Q9C005 
H7C2X7 P15880 P61289-1 Q68CQ4 Q9C0C2 
H7C543 P16070 P61313-1 Q68CZ2-1 Q9C0C9 
H7C5J7 P16104 P61353 Q68D10-1 Q9C0D5 
131124 P16152 P61421 Q68EM7 Q9COES-1 
I3L4IO P16278 P61513 Q6AI12 Q9COH2-1 
ІЗКЕН6 P16333-1 P61586 060088 Q9GZMS-1 
J3QLH3 P16401 P61587 Q6DKI4 Q9GZR2-1 
J3QSF4 P16402 P61604 Q6EMK4 Q9GZR7 
J3QTA6 P16435 P61758 06Е181-1 090753 
K7EIT2 P16615 P61764 Q6FIF0 Q9GZT3-1 
K7ELL7 P16930 P61769 Q6GMV3 Q9GZT8-1 
K7EM38 P16949-1 Р61916 Q6IANO Q9GZT9-1 
К7ЕММ8 P16989-1 P61923 Q6IBSO Q9GZZ1-1 
M0QZMI P17050 P61927 061022 Q9GZZ9-1 
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M0R0U0 P17096-1 P61956 Q6L8Q7-1 Q9H074 
MOR1BO P17096-2 P61970 Q6NUK1 09Н098 
000115 Р17174 P61978 Q6NUS6 Q9H0A0 
O00116 P17252 P61981 Q6NVVI Q9HOD6-1 
000139-2 Р17301 Р62081 Q6NXE6-1 Q9H0E3 
O00148 P17302 P62136-1 Q6NYC8 09Н019 
000151 Р17405-1 Р62140 Q6NZ67 Q9H0M0-1 
O00154 P17480-1 P62158 Q6NZD-1 Q9H0Q0 
000159-1 P17612 P62191 Q6NZY4 Q9H0U4 
O00178 P17655-1 P62195-1 Q6P158-1 Q9HIB7 
000186 P17812 P62241 Q6P1J9 Q9HIE3 
O00189 P17813 P62244 Q6PIN0 Q9HIES 
000193 P17844 P62249 Q6P1X6 О9Н 8-1 
000203-1 P17858-1 P62258-1 Q6P2E9-1 Q9H1Z4 
O00214-1 P17900 P62263 Q6P2Q9 Q9H223 
O00231 P17931 P62266 Q6P2S7 Q9H267 
O00233-1 P17936 P62269 Q6P3W7 Q9H270 
O00244 P17980 P62273-1 Q6P587-1 Q9H299 
O00264 P17987 P62277 Q6P5R6 Q9H2D6 
O00267 P18031 P62280 Q6P9B6 Q9H2G2-1 
O00273-1 P18077 P62304 Q6PGP7 Q9H2J4 
O00291 P18085 P62306 Q6PIU2 Q9H2K8 
O00299 P18124 P62316 Q6PJG6-1 Q9H2M9 
000303 P18206 P62318 Q6PJT7-1 Q9H2P0 
000391 Р18583-1 P62328 Q6PKGO-1 Q9H2U1 
000410 P18615 P62333 Q6QNYO Q9H2V7 
000422 P18621 P62424 Q6RWI3 Q9H2W6 
000425 P18669 P62491-1 Q6UB35-1 Q9H3F6-1 
000468-5 Р18887 P62495 Q6UNI5-1 Q9H3H3 
000469-1 P19338 P62633-1 Q6UUV7-1 Q9H3K6-1 
000487 P19367-1 P62701 Q6UVKI Q9H3NI 
000488 Р19388 Р62750 Q6UW63 Q9H3P7 
O00499 P19474 P62753 Q6UXHS-1 Q9H3S7 
000560-1 Р19623 P62805 06УҮ07 Q9H3U1 
O00562 P19784 P62820 Q6WCQI Q9H3U5 
O00567 P19878 P62826 Q6XE24 Q9H3Z4 
000570 P20042 P62829 Q6XQN6-1 Q9H444 
000571 P20073-1 P62834 Q6XZF7 Q9H479 
000622 Р20290-2 Р62841 Q6Y7W6-1 Q9H4A3 
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014497 P20340-1 P62847-1 Q6YNI6 Q9H4A4 
O14545 P20618 P62851 Q6YP21 Q9H4G4 
O14558 P20674 P62854 Q6ZN50 Q9H4I3-1 
014562. P20700 Р62857 Q6ZN55-1 Q9H4L4 
O14579-1 P20810-6 P62861 Q6ZS17 Q9H4L5 
O14617-1 P20908 P62873 Q6ZSJ8 Q9H4M9 
O14618 P20933 P62879 Q6ZSR9 Q9H501 
O14639-1 P20962 P62888 Q6ZUT6-1 Q9H583 
014647 P21266 P62899 Q6ZVK8 Q9HSNI 
014727 P21281 P62906 Q6ZVM7 Q9H5X1 
014744 P21283 P62910 Q6ZW31-1 Q9H6B4 
014757 P21291 P62913 Q709C8 Q9H6R4-1 
O14776-1 P21333 P62917 Q70E73 Q9H6S3 
O14777 P21399 P62937 Q70UQ0-4 Q9H6X2 
O14818-1 P21589-1 P62942 Q7IRC2 Q9H6Z4 
O14828 P21675 P62979 Q71U36 Q9H7B2 
O14879 P21796 P62993 Q71UI9-1 Q9H7C4-1 
014908-1 P21810 Р63000-1 Q71UMS Q9H7C9-1 
014936 P21912 P63010-1 QUKZF4 Q9H7E9-1 
014950 P21926 P63104-1 Q7L014 Q9H7F0-1 
014964 P21964-1 P63165-1 Q7L1Q6-1 Q9H832 
014974 P21980 P63167 Q7L2J0-1 Q9H8Y8-1 
014976 P22061 P63172 Q7L576-1 Q9H910-1 
014979-1 P22087 P63173 Q7L7X3 Q9H939-1 
O14981 P22102-1 P63208 Q7L8JA Q9H944 
015027-4 р22234 Р63218 071914 Q9H992 
015031 P22307-1 P63220 Q7LBRI Q9H9A6 
015042 р22314 P63241-1 Q7RTVO Q9H9B1 
015056 р22392 P63244 0722У/4 Q9H9B4 
O15061-1 P22626 P63267-1 072392 Q9H9J2 
O15066 P22681 P63272 Q7Z3E5-2 Q9H9Q2 
O15067 P22695 Р63279 О77370 Q9H9S5 
O15085-1 P22830-1 P63313 Q7Z417 Q9HA38-1 
015126-1 р23193 Р67809 072460-1 Q9HA65 
015127 р23219 Р67870 072478 Q9HA77 
015143 р23229-1 Р67936 Q7Z4GI-1 Q9HAB3 
015144 P23246-1 P67936-2 Q7ZAH3-1 Q9HAN9 
015145 р23258 P68036 Q7ZAH8 Q9HAUA 
015160 р23284 Р68104 072417-1 Q9HAV4 
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015173 P23381 P68366 0724У5 Q9HAV7 
O15212 P23396-1 P68371 Q7Z4W1 Q9HB07 
O15231-1 P23497 P68402-1 072610-1 Q9HB71 
O15260-1 P23526-1 P69905 0726М1-1 Q9HBL0 
O15270 P23528 P78318 Q7Z6Z7 Q9HC07 
O15294 P23588 P78344-1 Q7Z739 Q9HC38 
O15305 P23634-1 P78347 Q7Z7K6-1 Q9HCC0-1 
O15347 P23743-1 P78356 0727М9 Q9HCDS 
015355 P23921 P78357 0865Е5-1 Q9HCD6-1 
O15357 P24534 P78362-1 Q86T03-1 Q9HCEI 
015371 P24593 P78371-1 Q86TB9 Q9HCN8 
O15372 P24752 P78406 Q86TC9-1 Q9HCUO 
015427 P24844-1 P78417 Q86TS9-1 Q9HDIS 
015446-1 P25054-1 P78527 Q86TX2 Q9HD20 
015460-1 P25205 P78537 Q86U42-1 Q9HD26-1 
O15484 P25325 P78559-1 Q86UE4 Q9HD33-1 
015504 P25398 P78563-1 Q86UP2-1 Q9HD42-1 
015511-1 P25685 P80723 Q86UU1-2 Q9HD45 
O15530-1 P25705-1 P82094 Q86V48-1 Q9HDC9 
O15533-1 P25786-1 P82909 Q86V81 Q9NP77 
043143 P25787 P82979 Q86VM9 Q9NP79 
043175 P25788-1 P83731 Q86VP6-1 Q9NPD3 
043237 P25789 P83881 Q86VQ1 Q9NPE3 
O43242 P26006 P84090 Q86VS8 QONPF4 
043251-1 P26038 Р84095 Q86WNI-1 Q9NPJ6-1 
043252 P26368 P84098 Q86WV6 Q9NQ92 
043290 P26373-1 P84103 Q86X27-1 Q9NQA3 
O43294-1 P26440 P84157-1 Q86X55 Q9NQC3 
O43314-1 P26447 P84243 Q86X83 Q9NQG5 
O43324-1 P26572 P85037-1 Q86XL3-1 Q9NQH7-1 
043353-1 P26583 P98082 Q86XP3-1 Q9NQR4 
043390-1 Р26599 Р98160 Q86XZ4 QONQT4 
043396 Р26639-1 P98172 Q86Y56 Q9NQW6 
043399 Р26640 P98175-1 086Ү82 Q9NQW7-1 
O43488 P26641 P98179 Q86YDI-1 Q9NQX3 
043516-1 Р26885 P98194-1 Q86YP4 Q9NQX4 
O43520 P27105 P99999 Q86YR5-1 Q9NR09 
O43583 P27348 Q00325-1 Q86YV9 Q9NRI2-1 
043615 P27361-1 Q00341-1 Q8IVOS Q9NR28-1 
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043633 P27635 Q00587-1 QSIVF2-1 Q9NR30-1 
043665 P27694 Q00610-1 Q8IVH8 Q9NR45 
O43676 P27695 Q00653-1 Q8IVL5 Q9NR46 
O43678 P27707 Q00688 QSIVL6-1 Q9NR50-1 
O43681 P27708 000765 Q8IVM0 Q9NR56-1 
043684 р27797 000796 Q8IWAS5 Q9NRA2 
O43707 P27816-1 Q00839 Q8IWE2 Q9NRL3 
O43719 P27824 Q00872 Q8IWJ2 Q9NRV9 
O43747-1 P28066-1 Q01082-1 Q8IWR0 Q9NRX2 
O43765 P28070 Q01085 Q8IWZ3 Q9NRX4 
O43795 P28072 Q01105 Q8IWZ6 QONRYS 
043813 P28074-1 Q01130 QSIWZS-I Q9NRY6 
O43815 P28290 001433-1 Q8IX01-1 Q9NS69 
O43837-1 P28300 Q01469 Q8IX12 Q9NS86 
O43852 P28331-1 Q01518-1 QSIXBI-1 Q9NSD9 
O43854-1 P28482 Q01650 QSIXD-1 QONSE4 
043896 P28799 Q01658 Q8IXJ6 Q9NSY0-3 
O60218 P28838 Q01813 Q8IXL7 Q9NSYI 
060264 P29144 Q01955-1 QSIXQ4 Q9NT62 
060271 P29218-1 Q01968-1 Q8IY63-1 QONTIS 
060353-1 р29317 001970-1 Q8IY8I QONTKS-1 
060443 р29323 001995 081Ү95 Q9NTZ6 
O60488-1 P29353 Q02543 Q8IYDI Q9NUL3 
060493-1 р29401 002790 Q8IYI6 Q9NUM4 
O60502 P29466-1 Q02809 Q8IYL3 Q9NUP9 
O60524 P29536-1 Q02818 Q8IYS2-I Q9NUQ6 
O60547-1 P29558-1 Q02878 081207 Q9NUY8 
O60551 P29590 002952-1 Q8IZ21-1 Q9NV70-1 
060565 р29692 003001-7 081273 Q9NV92 
O60568 P29692-3 Q03252 Q8IZD4 Q9NV96 
O60610-1 P29966 Q03721-1 Q8IZL8 Q9NVA2-1 
060664-1 P29992 Q04206 Q8IZP0 Q9NVD7 
O60678 P30040-1 Q04323-1 Q8IZP2 Q9NVE7 
O60684 P30041 Q04446 Q8N0X7 Q9NVG8 
O60701 P30042-1 Q04637 Q8N122 Q9NVJ2 
O60716-1 P30043 Q04760-1 Q8N163-1 Q9NVMI 
O60749 P30044-1 Q04828 Q8NIF7-1 QONVN3-1 
060762 P30048 004837 Q8NIF8 Q9NVU0 
O60763-1 P30049 Q04917 Q8N1G4 Q9NVZ3 
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060783 P30050-1 Q04941-1 Q8N2K0 Q9NWI3-1 
060784 P30084 Q05519 Q8N335 Q9NWIS-1 
O60814 P30085-1 Q05682-4 Q8N392-1 Q9NWS82 
060826 P30086 005682-6 Q8N3D4 Q9NWH9-1 
O60828 P30101 Q06124 Q8N3F8 Q9NWK9 
O60831 P30153 Q06210-1 Q8N3V7-2 Q9NWSO-1 
060832-1 P30419 006323-1 Q8N4C8-I QONWV8 
060841 P30520 006830 Q8N4L2 Q9NWY4 
O60869-1 P30530 Q07020 Q8N556-1 Q9NWZ3-I 
060884 P30533 Q07021 Q8NSN7 Q9NXO05-1 
060907 P30566 007065 Q8N668 QONX14-1 
060925 P30622-1 Q07157 08684-1 Q9NX24 
075083 P30626-1 Q07352 Q8N6H7 Q9NX40 
060925 P30520 Q07666 Q8N6MO Q9NX46 
075083 P30530 007812 Q8N6T3-1 Q9NXG2 
O75116 P30533 Q07817-1 Q8N6Y2-1 Q9NXV6 
O75131 P30566 Q07866-1 Q8N766-1 Q9NXW2 
O75153 P30622-1 Q07954 Q8N983 Q9NY33 
O75157 P30626-1 Q07955-1 QSNOI9-1 QONY93 
075165 P30837 Q07960 Q8N9N8 Q9NYB9 
O75179-1 P30838 Q08170 Q8NAF0 Q9NYF8-1 
O75312 P30876 Q08211 Q8NAX2 Q9NYJ8-1 
075347 P31040 Q08257 Q8NBF2-1 Q9NYK1 
075348 P31146 Q08345-1 Q8NBJ5 Q9NYL2-2 
O75351 P31150 Q08378 Q8NBJ7-1 Q9NYL9 
O75367-1 P31153 Q08379 Q8NBK3 Q9NYU2-1 
075368 P31350-1 Q08623 Q8NBS9-1 09М208-1 
075369-1 P31689-1 Q08722 О8МС51-1 09732 
075376-1 P31930 Q08752 Q8NCN4 Q9NZ52-3 
075379-1 P31937 Q08945 Q8NCWS Q9NZ56 
075380 P31939 Q08ADI Q8ND76 QONZB2-1 
075390 P31942 QOSAES-1 Q8NDH3 Q9NZIS 
075396 P31943 008АМ6 Q8NDII Q9NZL9 
075400 P31946 008/23 Q8NDV7 Q9NZMI 
075410-1 P31947-1 Q09328 QSNE71-1 Q9NZMS 
075427 P31948 Q09666-1 Q8NE86 QONZN4 
075436-1 P31949 QOIIMS Q8NEU8-1 QONZT2-1 
075439 P32119 Q0ZGT2 Q8NEZ2 Q9NZUS 
075508 р32321 Q10471 Q8NF91 Q9NZZ3 
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075531 р32322 010567 Q8NFD5 Q9P013 
O75533-1 P32455 Q10713 Q8NFF5 Q9P0J7 
O75534 P32969 Q12765 Q8NFH5 Q9P0K7-1 
O75569-1 P33121 Q12769 Q8NFQ8 Q9P0M6 
O75608 P33176 Q12792-2 Q8NFW8 Q9P0P0 
O75643-1 P33240 Q12797 Q8NHH9 Q9P0V3-1 
O75663 P33316-2 Q12841 Q8NHP8 Q9P1Q0 
075683 P33402 Q12846 Q8NHS3 Q9P246 
075694 P33897 012849 Q8NHV4 Q9P258 
075718 P33991 Q12873-1 Q8ND2 Q9P266 
075821 P33992 012874 Q8TAA9 Q9P270 
O75822 P33993-1 Q12884-1 Q8TAD7 Q9P299 
075828 P34897-1 012888-1 QSTAE8 Q9P2D3-1 
075832 P34932 Q12904-1 Q8TAQ2 Q9P2E9-1 
075844 р35052 012906-1 QSTAT6-1 Q9P2E9-3 
075874 P35080-2 Q12907 Q8TB52 Q9P2J5 
O75880 P35221 Q12929 Q8TB61 Q9P2N6 
O75937 р35222 012931 Q8TBC4-I Q9P2R3 
O75940 р35232 012965 QSTBXS-1 Q9P2R7 
075947-1 P3524 1-1 012972-1 Q8TCO7-1 Q9P2T1-1 
075955 P35268 Q12974 Q8TCI2 Q9P2X0-1 
075962-1 P35270 Q13017-1 Q8TCI2 Q9P2X3 
076003 P35520-1 Q13033 Q8TCS8 Q9UBB4-1 
O76021 р35527 013045-1 О8ТСТ9-1 090ВВ6 
076024 р35555 013049 Q8TD16 Q9UBB9-1 
076061 P35579-1 Q13057 О8Тр19 Q9UBC2-1 
076071 P35580 Q13098-7 QSTD43-1 Q9UBEO-1 
076074 P35606 Q13123 Q8TD55 Q9UBG0 
O76094 P35611-1 Q13131 Q8TDB4 Q9UBM7 
O76095-1 P35613 Q13136 Q8TDD1 Q9UBN7-1 
094760-1 P35626 013151 Q8TDN6 Q9UBP4 
O94776 P35637-1 Q13155 Q8TDZ2 Q9UBQ0 
O94808 P35658-1 Q13162 Q8TEA8 Q9UBQ5 
O94826 P35659-1 Q13177 Q8TEDI Q9UBQ7 
O94842 P35813 Q13200 Q8TEX9-I Q9UBR2 
O94855 P35858-2 Q13217 Q8TF05 Q9UBS4 
O94874-1 P35908 Q13243-1 Q8TF72 090858-1 
094888 P35914 013247 Q8TF74 Q9UBT2 
O94903 P35998 Q13263 Q8WTS6 Q9UBT6 
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094905-1 P36021 Q13283 Q8WU79-1 Q9UBU9-1 
094919 P36507 013308-1 Q8WU90 090076-1 
094925-3 P36542-1 Q13310 Q8WUA7 Q9UDY2 
O94973 P36543 Q13330-1 Q8WUDI Q9UDY4 
O95140 P36551 Q13347 Q8WUH6 Q9UEES 
095155 Р36578 013356 Q8WUM4 Q9UER7-1 
095159 P36776 Q13363-1 Q8WUP2 Q9UEYS-1 
095168-1 P36871-1 Q13371 Q8WUQ7 Q9UG63 
095182 P36915-1 Q13409-7 Q8WUWI Q9UGI8 
095183 Р36957 013418 Q8WVM8 Q9UGR2-2 
O95197 P37108 Q13422-2 Q8WVY7 Q9UGV2 
O95208-1 P37268 Q13423 Q8WWOI 090Н65 
095218-1 P37802 013425-1 Q8WWI2 Q9UHB6 
095232-1 P37837 Q13428-1 Q8WWII-3 Q9UHB9 
095249-1 P38117 Q13435 Q8WWMT-1 Q9UHDI 
095260 P38159-1 Q13438-7 Q8WWQO Q9UHD2 
095292 P38606 Q13439-1 Q8WWY3 Q9UHDS-1 
095297-1 P38646 Q13442 Q8WX93-1 Q9UHD9 
095302 P38919 Q13443 Q8WXFI Q9UHJ6 
095336 P38936 013444 Q8WXI9 Q9UHN6-1 
O95340 P39019 Q13492-1 Q8WYP3 Q9UHQ9 
O95347-1 P39023 Q13501-1 Q8WZ82 Q9UHV9 
O95352 P39656 Q13510 Q8WZA9 Q9UHX1-1 
095372 Р39687 013523 092499 Q9UHY1 
095373 P40121 013557-10 092508 09010-1 
095394 P40123 Q13561-1 Q92520 Q9UII2-1 
095400 P40222 Q13563 Q92522 090142-1 
095425 P40227-1 Q13564-1 Q92538 Q9UIA9 
095433 P40261 Q13573 Q92567 Q9UID3-1 
095453 P40763 Q13586 Q92572 Q9UIGO 
095456 P40855-1 Q13596 Q92574-1 Q9UIJ7-1 
095479 P40925 Q13610 Q92597 Q9UJ70 
095487-1 Р40926 013616 092598 090799 
095571 Р40939 Q13617-1 092614-1 090780 
095573 P41091 Q13618 Q92616 Q9UJU6-1 
095633 Р41208 Q13619-1 Q92621 Q9UJWO-1 
095671-1 P41214 Q13620-1 Q92626 Q9UJX4 
095674-1 P41227-1 Q13630 Q92686 Q9UJY1 
095696 Р41236 013637 092692-1 090721 
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095714 р41240 013641 092696 Q9UK41-1 
095721 P41250 Q13642 Q92734-1 Q9UK45 
O95747 P41252 Q13685 Q92783 Q9UK76 
O95757 P42166 Q13724-1 Q92785 Q9UKGI 
095782 P42224-1 Q13765-1 Q92820 Q9UKD 
095785 P42285 QI3813 Q92839 Q9UKK3 
095793 P42345 Q13823 Q92841-1 Q9UKK9 
095801 P42356 Q13838-1 Q92859 Q9UKM9-1 
095810 P42574 Q13868 Q92871 Q9UKV3-1 
095816 р42677 013884 092879 Q9UKV8 
O95817 P42696 Q13885 Q92888-1 Q9UKX7 
O95819 P42704 Q13951 Q92890-1 Q9UKY7 
O95825-1 P42765 Q13952-1 Q92896 Q9UL25 
O95831-1 P42766 Q13976 Q92900 Q9UL46 
O95861-1 P42768 Q14008 Q92917 Q9ULA0 
O95865 P43034 Q14011 Q92930 Q9ULC4-1 
095881 P43121 Q14012 Q92945 Q9ULDO 
095980 P43243 Q14019 Q92973-1 Q9ULD4 
095989 P43246-1 Q14103 Q92974 Q9ULHI-1 
096005-1 р43304 Q14108-1 Q92979 Q9ULP9 
O96019-1 P43487 Q14137 Q92995 Q9ULRO-1 
P00167-1 P43490 Q14139 Q93008-1 Q9ULTS 
P00338-1 P43686 014152 093050-1 Q9ULU4-13 
P00352 P45880 Q14157-5 Q93052 Q9ULV4 
P00367 P45954 Q14161-1 Q93062-1 Q9ULWO 
P00374 P45973 014165 Q969E2-1 Q9UMS4-6 
P00387 P45974-1 Q14192 Q969G5 Q9UMR2-1 
P00390 Р46060 014195-1 Q969H8 Q9UMS4 
P00441 P46063 Q14202 Q969M3 Q9UMX0-1 
Р00505 Р46087 014203 (09692. Q9UMYI-1 
Р00533-1 Р46108-1 014204 Q969T9 Q9UMY4-1 
P00558 P46109 Q14232 Q969V3 Q9UN37 
P00568 P46379-1 Q14240-1 0969У6 09086 
Р00750-1 Р46439 Q14247-1 Q969X5 Q9UNE7-1 
P01023 P46459 Q14254 Q96A35 Q9UNFO 
P01112 P46734-1 Q14257 Q96A49 Q9UNFI 
Р01116 Р46776 014258 096457 Q9UNH7-1 
Р01614 Р46777 014315 096465 Q9UNM6 
P01834 P46778 Q14393-1 Q96ACI Q9UNNS-1 
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Р01857 Р46779 Q14444-1 Q96AE4 Q9UNS2 
P01859 P46781 Q14498-1 Q96AG3 Q9UNWI 
P01892 P46782 Q14517 Q96AG4 Q9UNZ2 
P02452 P46783 Q14533 Q96AY3 Q9UP83-1 
P02461 P46821 Q14554 Q96B36 Q9UPN3 
P02533 Р46934 014644 096854 Q9UPUS 
P02545 P46937-1 014651 Q96B97-1 Q9UPYS 
P02751 P46939 Q14669-1 Q96BJ3-1 Q9UPZ6 
P02768-1 P46940 Q14677-1 096ВҮ6-1 090035 
Р02786 Р46976-1 Q14678-1 096С01 090080 
Р02787 Р47756-2 014683 Q96C03-2 090088-1 
Р02795 P47813 014684 096С19 Q9UQB8 
P03956 P47897 Q14690 Q96C23 Q9UQE7 
P04040 P47914 Q14692 Q96C57 Q9Y223-1 
P04075 P48047 Q14694 Q96C86 Q9Y224 
P04080 P48059-3 Q14696 Q96C90 Q9Y230 
P04083 P48147 Q14697-1 Q96CG8 Q9Y240 
P04114 P48163 Q14699 Q96CT7 Q9Y262 
P04156-1 P48444 Q14764 Q96CV9 Q9Y263 
Р04179-4 P48449-1 Q14766-1 Q96CWI Q9Y265 
P04181-1 P48506 Q14767 Q96CX2 Q9Y266 
P04259 P48509 Q14789 096015 09Ү277 
Р04264 P48556 Q147X3 Q96D46 Q9Y281-1 
P04406-1 P48634-1 Q14839 Q96EC8-1 Q9Y285 
P04632 P48637 Q14847 Q96EMO Q9Y294 
P04732-1 P48643 Q14914 Q96EPS Q9Y295 
P04792 P48681 Q14966-1 096Е45 09Ү2А7-1 
P04843 P48723 Q14974 Q96FWI1 Q9Y2BO 
P04844-1 P48735 Q14978-1 Q96FZ7 Q9Y2B9 
P04899-1 P48739 Q14980-1 Q96G03 Q9Y2DS 
P05023 P49006 Q14C86-1 Q96G23 Q9Y2G9 
P05026 P49023 Q15004-1 Q96G46 Q9Y2H5 
P05067-1 P49137 Q15006 Q96GD0 Q9Y2H6 
P05091 P49189 Q15007 Q96GK7 Q9Y2J2 
Р05114 49207 Q15008 Q96GP6 Q9Y2K2 
P05120 P49257 015018 096054 Q9Y2LI-1 
P05121-1 P49321 015019 Q96GY0 Q9Y2Q3-1 
Р05141 P49327 Q15020 Q96HC4 Q9Y2Q9 
P05198 P49368-1 Q15021 Q96HE7 Q9Y2S6 
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P05204 P49411 Q15029 Q96HY6-1 09Ү2У2 
P05362 P49419 Q15035 096124-1 Q9Y2WI 
P05386 P49458 Q15042-1 096199-1 Q9Y2W2 
P05387 P49585 Q15046 09618-1 Q9Y2X3 
P05388 P49588 Q15052 096116 09Ү2Ү0 
Р05455 P49589-1 Q15056-1 096120 Q9Y2Z0-1 
Р05556-1 P49591 Q15057 096102. Q9Y305-1 
P05783 P49720 Q15070-1 Q96J84-1 Q9Y314 
P05787 P49721 Q15075 09606-1 Q9Y320 
P05997 P49736 015084-1 Q96JH7 09Ү333 
P06132 P49748-1 015102. Q961J3 Q9Y371 
P06396-2 P49750-1 Q15113 Q96JJ7-1 Q9Y376 
P06454-1 P49756-1 Q15121 Q96JY6-1 Q9Y383 
P06493 P49770 Q15126 Q96K17-1 Q9Y388 
P06576 P49773 Q15149-1 Q96KG9 Q9Y399 
P06703 P49790 Q15149-2 Q96KP4 09ҮЗА5 
P06733-1 P49792 Q15149-4 Q96KRI Q9Y3A6-1 
P06737-1 Р49810-1 015154-1 0961.92 Q9Y3B2 
P06744 P49821 Q15155 Q96L96 Q9Y3B4 
P06748 P49841-1 015181 096117 09ҮЗВ8-1 
Р06753 P49903-1 015185 Q96M96-1 О9ҮЗСІ 
Р06753-2 P49915 Q15233 Q96MEI Q9Y3D0 
P06756 P50213-1 Q15257-1 Q96MW1 Q9Y3D6 
P07195 P50081 Q15274 Q96MWS 09ҮЗЕІ 
Р07237 P50336 Q15276 Q96MX6 Q9Y3ES 
P07305 P50395-1 Q15293 Q96N66-1 Q9Y3E7 
P07339 P50402 Q15303 Q96N67 Q9Y3F4 
P07355 P50416 Q15334 Q96P47 09Ү310 
P07384 P50454 Q15365 Q96PD2-1 Q9Y3L3-1 
P07437 P50479-1 Q15366-1 Q96PE2 Q9Y3T9 
P07585 P50552 Q15369-1 Q96PK6-1 09Ү308 
P07602-1 P50570-2 Q15393-1 Q96PK6-5 Q9Y3X0 
P07686 P50579 Q15404-1 Q96PRI Q9Y3Y2 
P07711 PS50613 Q15417 096Р05 09Ү450-1 
Р07737 P50750 Q15424-1 096Р08 Q9Y490 
P07741-1 P50897 Q15427 Q96QK1 09Ү4В5 
Р07814 P50914 Q15434 Q96QZ7 09Ү4Е1 
Р07858 P50990 Q15435 Q96RE7 Q9Y4E8 
P07900 P50991 Q15436 Q96RF0-2 Q9Y4F1 
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P07910-1 P50995 015459 096544 Q9Y4II-1 
P07919 P51114-1 Q15477 Q96S52 Q9Y4K0 
P07942 PS51116 Q15532 Q96859-1 Q9Y4K4 
P07948-1 P51148 Q15555 096566 Q9Y4L1 
Р07951-3 Р51149 015572-1 096582 Q9Y4P1-1 
P07954 P51153 Q15599 096597 Q9Y4W2 
P07992-1 P51397 Q15629 Q96SB3 Q9Y4W6 
P07996 P51452 Q15637 Q96SB4-3 Q9Y4X5 
P08123 P51570 Q15642 Q96SL4 Q9Y520-7 
P08133-1 P51572 Q15643 Q96SZ5 Q9Y570 
P08134 P51610-1 Q15654-1 Q96T37-1 Q9Y5A9-1 
Р08195 Р51659-1 015691 Q96T51 09Ү5В9 
P08237 P51665 Q15717 Q96T60 09Ү5Л 
P08238 P51812 Q15746-1 Q96176-1 Q9Y5J9 
P08240-1 P51858 015758-1 Q96TA1 09Ү5К8 
P08243-1 P51884 Q15785 Q96TC7 Q9YSLO-1 
P08572 Р51911 Q15796-1 Q99417 09Ү5М8 
Р08574 Р51991-1 015811 099426 09Ү551 
Р08579 P52209 015813 099436 09Ү552 
Р08621-1 р52272 015819 099439 09Ү5Ү5 
Р08648 р52292 015836 Q99447-1 09Ү577 
P08670 P52306-1 015843 099459 09Ү608 
P08708 P52565-1 Q15904 Q99460 Q9Y617-1 
P08727 P52594-1 Q15942 Q99470 Q9Y657 
P08729 P52597 016181 Q99471-1 Q9Y678 
P08758 P52630-3 Q16204 Q99497 Q9Y680-2 
Р08865 P52701 Q16222-1 099536 09Ү696 
Р09012 P52788-1 Q16270-1 Q99543-1 09Ү6С9 
Р09067 P52789 016352 099567 Q9Y6D5 
P09104 P52815 Q16512-1 Q99575 Q9Y6D9 
P09110-1 P52895 Q16527 Q99614 Q9Y6E0 
P09211 P52907 Q16531 Q99623 Q9Y6G9 
P09326 P52926-1 Q16540 Q99653 Q9Y613 
P09382 P52943 Q16543 Q99700-1 09Ү619 
Р09417 P52948-1 Q16555-1 Q99707 Q9Y6K8 
P09429 P53004 Q16576-1 Q99714-1 Q9Y6K9 
Р09493-10 Р53041 016610 099715-1 Q9Y6MI 
P09496-2 P53367 Q16626 Q99729-2 Q9Y6MS 
P09497-2 P53396-1 Q16629 Q99733 Q9Y6N5 
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Table S 6. GO analysis of A549 cell-derived exosomes. The analysis was performed by using g:Profiler sofiware. 


P09525 P53582 Q16630 Q99798 Q9Y6R0 
P09543-1 P53597 Q16637 Q99805 Q9Y6WS 
P09622 P53602 Q16643 Q99816 Q9Y6Y8 
P09651-1 P53611 Q16658 Q99829 S4R3G0 
Р09661 P53618 Q16666-1 Q99832 S4R3N1 
Р09669 P53611 Q16698 Q99848 X6RLX0 
P09874 P53618 Q16774-1 Q99879 


*The list of proteins associated with each ontology is available on request deanna.shea@vuw.ac.nz 


Source Term name Term ID Adjusted Negative 10210 Term | Query | Inter | Effective 
p-value of adjusted size size secti | domain 
p-value on | size 
size 
GO:MF | cell adhesion molecule binding GO:0050839 1.521742670716 | 43.817658781295 | 548 301 74 18694 
54E-44 00 
СО:МЕ | structural molecule activity GO:0005198 4.896627476702 |38.310102934774 | 698 301 76 18694 
85E-39 20 
СО:МЕ | protein-containing complex GO:0044877 2.667191990529 |25.573945721683 | 1282 |301 83 18694 
binding 75E-26 00 
GO:MF | cadherin binding GO:0045296 6.815619888626 | 25.166494638440 | 332 301 45 18694 
2E-26 20 
GO:MF | RNA binding GO:0003723 3.083876991424 |24.510902953257 | 1946 |301 101 |18694 
21E-25 60 
СО:МЕ |enzyme binding GO:0019899 2.951197890109 |19.530001668269 | 1866 |301 91 18694 
94E-20 800 
СО:МЕ | integrin binding GO:0005178 2.947859242009 | 17.530493257542 | 148 301 27 18694 
29Е-18 300 
СО:МЕ | structural constituent of ribosome | GO:0003735 1.282677879149 | 15.891882394967 | 170 301 27 18694 
99E-16 400 
GO:MF | extracellular matrix structural GO:0005201 1.650944100266 |13.782267631374 | 169 301 25 18694 
constituent 37E-14 90 
СО:МЕ | unfolded protein binding GO:0051082 3.365118365701 | 13.472999655164 | 111 301 21 18694 
82E-14 500 
СО:МЕ | signaling receptor binding GO:0005102 3.25033508142Е | 11.488071864710 | 1681 |301 72 18694 
412 300 
СО:МЕ | protease binding GO:0002020 3.884661412038 | 11.410646828418 | 139 301 21 18694 
77Е-12 100 
СО:МЕ | organic cyclic compound binding | GO:0097159 1.304944975246 | 9.8844078005483 | 6387 |301 164 | 18694 
65E-10 50 
GO:MF | carbohydrate derivative binding | GO:0097367 4.883418067658 | 9.3112760943304 | 2286 |301 82 18694 
85E-10 10 
GO:MF | small molecule binding GO:0036094 1.291021948369 | 8.8890663743292 | 2507 |301 86 18694 
98E-09 90 
СО:МЕ | heterocyclic compound binding | GO:1901363 5.536373022393 | 8.2567746562483 | 6302 | 301 158 |18694 
27E-09 1 
СО:МЕ | identical protein binding GO:0042802 1.353008781762 | 7.8686993845860 |2080 |301 74 18694 
07Е-08 90 
СО:МЕ | protein binding GO:0005515 1.908188168118 | 7.7193788013581 | 14752 | 301 279 |18694 
95Е-08 60 
СО:МЕ | MHC class II protein complex GO:0023026 2.300344792476 | 7.6382070638749 | 16 301 8 18694 
binding 15Е-08 90 
СО:МЕ | peptidase regulator activity GO:0061134 5.183640230762 | 7.2853651481900 | 225 301 21 18694 
04E-08 80 
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СО:МЕ | structural constituent of GO:0005200 | 5.721917470651 | 7.2424584104768 | 104 |301 15 |18694 
cytoskeleton 1E-08 10 
СО:МЕ | lipoprotein particle receptor GO:0070325 | 7.701963771027 | 7.1133985285696 |26 |301 9 18694 
binding 53E-08 70 
GO:MF | endopeptidase inhibitor activity | GO:0004866 | 3.447913443158 | 6.4624436452924 |179 |301 18 |18694 
08E-07 10 
СО:МЕ | nucleoside-triphosphatase GO:0017111 |3.914263937615 | 6.4073498933471 | 840 |301 40 |18694 
activity 36E-07 00 
СО:МЕ | peptidase inhibitor activity GO:0030414 | 5.398648346759 | 6.2677149603664 | 184 |301 18 |18694 
84Е-07 20 
СО:МЕ | nucleoside phosphate binding | GO:1901265 | 6.636826790470 | 6.1780395165337 |2173 |301 72 | 18694 
228-07 30 
СО:МЕ | endopeptidase regulator activity | GO:0061135 | 7.646603782511 |6.1165314126915 | 188 |301 18 |18694 
448-07 40 
СО:МЕ | MHC protein complex binding | GO:0023023 | 0.000001175135 | 5.9299120829416 |24 |301 8 18694 
4211733100 80 
СО:МЕ | nucleotide binding GO:0000166 | 0.000001598688 | 5.7962361672638 |2172 |301 71 | 18694 
4330650600 50 
СО:МЕ | ubiquitin protein ligase binding | GO:0031625 | 0.000001824870 | 5.7387678918396 | 300 |301 22 | 18694 
741701890 50 
СО:МЕ | enzyme inhibitor activity GO:0004857 | 0.000001963738 | 5.7069163646323 |386 |301 25 | 18694 
4123587700 00 
СО:МЕ | pyrophosphatase activity GO:0016462 | 0.000002253355 | 5.6471702893699 | 893 |301 40 |18694 
4852308200 400 
СО:МЕ ctivity, acting on acid | GO:0016817 | 0.000002555506 | 5.5925229326974 | 897 |301 40 |18694 
955266510 90 
СО:МЕ ctivity, acting on acid | GO:0016818 | 0.000002555506 | 5.5925229326974 | 897 |301 40 |18694 
іп phosphorus- 955266510 90 
containing anhydrides 
СО:МЕ | protein domain specific binding | GO:0019904 | 0.000002724960 | 5.5646397886512 | 718 |301 35 |18694 
500373840 00 
СО:МЕ | nucleic acid binding GO:0003676 | 0.000003719128 | 5.4295588296280 | 4353 |301 114 |18694 
3835295900 90 
СО:МЕ | ubiquitin-like protein ligase GO:0044389 | 0.000005305132 | 5.2753037847471 |318 |301 22 | 18694 
binding 255959490 
СО:МЕ | anion binding GO:0043168 | 0.000005586081 | 5.2528927385785 | 2424 |301 75 |18694 
4193011400 30 
СО:МЕ | collagen binding GO:0005518 | 0.000006070143 | 5.2168010559806 |69 |301 11 |18694 
3039277000 900 
СО:МЕ | lipid binding GO:0008289 | 0.000006479860 | 5.1884343427347 | 779 |301 36 [18694 
513284850 60 
СО:МЕ | actin binding GO:0003779 | 0.000007985520 | 5.0976967557379 | 445 |301 26 |18694 
782161870 00 
СО:МЕ | purine ribonucleoside GO:0035639 | 0.000008300102 | 5.0809165685059 | 1850 | 301 62 | 18694 
triphosphate binding 038947060 40 
СО:МЕ | low-density lipoprotein particle | GO:0050750 | 0.000012175050 | 4.9145292180912 | 21 301 7 18694 
receptor binding 798285000 50 
СО:МЕ | kinase binding GO:0019900 | 0.000012623682 | 4.8988139274450 | 763 |301 35 | 18694 
717688400 50 
СО:МЕ | purine nucleotide binding GO:0017076 | 0.000018300640 | 4.7375337111483 | 1935 |301 63 [18694 
461246 80 
СО:МЕ | double-stranded RNA binding | GO:0003725 | 0.000019837867 | 4.7025050098761 | 77 301 11 |18694 
724024400 90 
СО:МЕ | actin filament binding GO:0051015 | 0.000026994362 | 4.5687269285653 |210 |301 17 |18694 
240371300 80 
СО:МЕ | disordered domain specific GO:0097718 | 0.000032239423 | 4.4916127381738 | 35 301 8 18694 
binding 09952750 20 
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GO:ME | protein dimerization activity GO:0046983 | 0.000036826815 | 4.4338358309606 | 1068 | 301 4 | 18694 
7552572 10 
СО:МЕ | cytoskeletal protein binding GO:0008092 | 0.000043636249 | 4.3601525808836 | 995 |301 40 |18694 
78655650 40 
GO:ME | heparin binding GO:0008201 | 0.000045841879 | 4.3387375803594 | 168 |301 15 |18694 
96487140 20 
СО:МЕ | chaperone binding GO:0051087 | 0.000054666810 | 4.2622762648959 | 104 |301 12 |18694 
387509600 80 
СО:МЕ | misfolded protein binding GO:0051787 | 0.000064275506 | 4.1919544941790 |26 |301 7 |18694 
2322953 50 
СО:МЕ | protein folding chaperone GO:0044183 | 0.000064275506 | 4.1919544941790 |26 |301 7 |18694 
2322953 50 
СО:МЕ | purine ribonucleotide binding | GO:0032555 | 0.000077390581 | 4.1113118899867 | 1921 |301 61 |18694 
52220150 10 
СО:МЕ | ribonucleotide binding GO:0032553 | 0.000105342420 | 3.9773967052567 | 1938 | 301 61 |18694 
89493600 7 
СО:МЕ | extracellular matrix structural | GO:0030020 | 0.000120351870 | 3.9195471550665 | 41 |301 8 |18694 
constituent conferring tensile 6376760 000 
strength 
СО:МЕ | sulfur compound binding GO:1901681 | 0.000147038043 | 3.8325702836318 |264 |301 18 |18694 
85375700 60 
СО:МЕ | glycosaminoglycan binding GO:0005539 | 0.000154335077 | 3.8115353561583 |237 |301 17 |18694 
3154370 50 
СО:МЕ | growth factor binding GO:0019838 | 0.000164425585 | 3.7840306022987 | 137 |301 13 |18694 
76355200 20 
СО:МЕ | protein heterodimerization GO:0046982 | 0.000187397671 | 3.7272358085949 |358 |301 21 |18694 
activity 98467700 200 
СО:МЕ | exogenous protein binding GO:0140272  |0.000207263882 | 3.6834763713383 |77 |301 10 |18694 
2584040 20 
СО:МЕ | platelet-derived growth factor | GO:0048407 | 0.000216888143 | 3.6637641890870 | 11 |301 5 | 18694 
binding 28913100 10 
СО:МЕ | protein kinase binding GO:0019901 | 0.000295636283 | 3.5292422662058 | 679 |301 30 |18694 
3311300 700 
СО:МЕ | oxidoreductase activity, acting on | GO:0016616 | 0.000412460259 | 3.3846178895145 | 125 |301 12 |18694 
the CH-OH group of donors, 31831000 400 
NAD or NADP as acceptor 
СО:МЕ | antioxidant activity GO:0016209 | 0.000655979008 | 3.1831100577000 |87 |301 10 |18694 
8610940 30 
GO:MF | GTPase activity GO:0003924 | 0.000781850703 | 3.1068761689402 |327 |301 19 |18694 
1137840 100 
СО:МЕ | proteoglycan binding GO:0043394 | 0.000864303094 | 3.0633339322211 |37 |301 7 |18694 
2085160 700 
СО:МЕ | oxidoreductase activity, acting on | GO:0016614 | 0.001096296591 | 2.9600719362793 | 137 |301 12 |18694 
CH-OH group of donors 188710 20 
СО:МЕ | ADP binding GO:0043531 | 0.001256116573 | 2.9009700542256 |39 |301 7 |18694 
3302300 00 
СО:МЕ | extracellular matrix binding GO:0050840 | 0.001261388260 | 2.8991512151776 |55 |301 8 |18694 
6225500 20 
СО:МЕ | enzyme regulator activity GO:0030234 | 0.001464910640 | 2.8341888665195 | 1228 | 301 42 |18694 
2680000 30 
СО:МЕ | virus receptor activity GO:0001618 | 0.001734914274 | 2.7607219796317 |76 |301 9 |18694 
696150 30 
СО:МЕ | scrine-type endopeptidase GO:0004867 | 0.001965121096 | 2.7066106818781 |98 |301 10 |18694 
inhibitor activity 914740 800 
СО:МЕ | retinal dehydrogenase activity | GO:0001758 | 0.002127782585 | 2.6720727498256 | 8 301 4 | 18694 
5984300 2 
СО:МЕ | fibronectin binding GO:0001968 | 0.002249625451 | 2.6478897831752 |28 |301 6 |18694 


4824200 


800 
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СО:МЕ | phospholipid binding GO:0005543 0.002425686419 | 2.6151653432241 | 455 301 22 18694 
587960 600 

GO:MF | lipase inhibitor activity GO:0055102 0.002683019988 | 2.S713760918267 | 17 301 Š 18694 
4703400 40 

GO:MF | nucleoside binding GO:0001882 0.003350413337 | 2.4749016110926 | 395 301 20 18694 
832490 100 

GO:MF | heat shock protein binding GO:0031072 0.003493219945 | 2.4567740688210 | 128 301 11 18694 
191660 300 

СО:МЕ GO:0016491 0.003827816558 | 2.4170488832032 | 771 301 30 18694 
557590 20 

GO:MF | GTP binding GO:0005525 0.007315819876 | 2.1357369958609 | 382 301 19 18694 
6506900 700 

GO:MF | ATPase activity GO:0016887 0.008315285370 | 2.0801228417034 | 456 301 21 18694 
995490 000 

GO:MF | MHC class | protein binding GO:0042288 0.008368422728 | 2.0773563896364 | 21 301 5 18694 
956100 70 

GO:MF | purine ribonucleoside binding GO:0032550 0.008456241844 | 2.0728226049043 | 386 301 19 18694 
553530 80 

GO:MF | purine nucleoside binding GO:0001883 0.009413540588 | 2.0262470005109 | 389 301 19 18694 
267930 000 

СО:МЕ | ribonucleoside binding GO:0032549 0.009413540588 | 2.0262470005109 | 389 301 19 18694 
267930 000 

GO:MF | intramolecular oxidoreductase GO:0016860 0.010382340855 | 1.9837047172043 | 53 301 7 18694 
activity 381200 100 

GO:MF | phospholipase inhibitor activity | GO:0004859 0.014297814108 | 1.8447303536805 | 12 301 4 18694 
034400 700 

GO:MF | alditol:NADP+ 1-oxidoreductase | GO:0004032 0.014297814108 | 1.8447303536805 | 12 301 4 18694 
activity 034400 700 

GO:MF | guanyl nucleotide binding GO:0019001 0.015291142463 | 1.8155600654619 | 403 301 19 18694 
671 500 

GO:MF | guanyl ribonucleotide binding GO:0032561 0.015291142463 | 1.8155600654619 | 403 301 19 18694 
671 500 

GO:MF | MHC protein binding GO:0042287 0.019485516852 | 1.7102880700791 | 40 301 6 18694 
358300 300 

GO:MF | C-X3-C chemokine binding GO:0019960 0.019588891401 | 1.7079901414411 | 5 301 3 18694 
190600 500 

GO:MF | peptide disulfide oxidoreductase | GO:0015037 0.020390896101 | 1.6905636882296 | 13 301 4 18694 
activity 492400 100 

СО:МЕ | adeny! nucleotide binding GO:0030554 0.029715190353 | 1.5270214833337 | 1580 |301 46 18694 
635700 900 

GO:MF | rRNA binding GO:0019843 0.035977948560 | 1.4439636034708 | 64 301 7 18694 
985500 300 

СО:МЕ | tau protein binding GO:0048156 0.038649890354 | 1.4128517337919 | 45 301 6 18694 
305800 000 

GO:MF | C3HC4-type RING finger GO:0055131 0.038710026597 | 1.4121765302765 | 6 301 3 18694 
domain binding 71820 400 

GO:MF | hydrolase activity GO:0016787 0.039110578770 | 1.4077057571748 | 2588 |301 66 18694 
566800 50 

GO:MF | ATP binding GO:0005524 0.039222139295 | 1.4064687222689 | 1504 |301 44 18694 
79040 800 

GO:BP | establishment of localization GO:0051234 9.762527764983 | 28.010437717963 | 5398 |301 188 | 18092 
83E-29 600 

GO:BP | biological prox involved in GO:0044403 2.351404444275 | 27.628672665298 | 996 301 78 18092 
symbiotic interaction 2E-28 600 

GO:BP | transport GO:0006810 6.163438833998 | 27.210176909601 | 5259 |301 184 | 18092 
53E-28 90 

GO:BP | establishment of localization in | GO:0051649 1.173972688485 | 26.930342006476 | 2840 |301 131 |18092 
cell 62E-27 800 
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GO:BP | viral process GO:0016032 | 8.655101248629 |26.062727847319 |940 |301 74 |18092 
61E-27 00 
GO:BP | localization GO:0051179 | 5.218218002912 | 25.282477781218 | 6935 |301 211 |18092 
55Е-26 10 
GO:BP | cellular localization GO:0051641 | 3.328578905879 | 24.477741143389 |3562 |301 144 |18092 
9E-25 500 
GO:BP | platelet degranulation GO:0002576 | 3.197477743748 |23.495192469773 |127 — |301 31 |18092 
68Е-24 700 
GO:BP | regulated exocytosis GO:0045055 | 8.415692153848 | 23.074910159398 | 799 [301 65 |18092 
97E-24 70 
GO:BP GO:0044419 | 6.088408208870 | 22.215496237164 | 2272 |301 109 |18092 
728-23 40 
organisms 
GO:BP | exocytosis GO:0006887 | 1.307104873822 | 21.883689565980 |921 |301 68 |18092 
94E-22 800 
GO:BP | SRP-dependent cotranslational | GO:0006614 | 1.307369097706 | 21.883601784670 |96 |301 27 |18092 
protein targeting to membrane 77E-22 700 
СО:ВР | cotranslational protein targeting | GO:0006613 | 5.970872864851 | 21.223962175959 | 101 |301 27 | 18092 
to membrane 1Е-22 600 
СО:ВР | vesicle-mediated transport GO:0016192 | 9.205291357999 |21.035962461025 |2193 | 301 105 | 18092 
258-22 400 
GO:BP | protein targeting to ER GO:0045047 | 1.000968306499 | 20.999579673300 | 114 |301 28 |18092 
6E-21 100 
GO:BP shment of protein GO:0072599 | 2.833969780405 | 20.547604785095 |118 [301 28 |18092 
ization to endoplasmic 59Е-21 300 
reticulum 
GO:BP | export from cell GO:0140352 | 5.223811674008 | 20.282012488397 | 1450 |301 83 [18092 
43E-21 100 
GO:BP | establishment of protein GO:0072594 | 6.043541589298 | 19.218708484900 | 589 |301 52 |18092 
localization to organelle 07E-20 300 
GO:BP | nuclear-transcribed MRNA GO:0000184 | 1.854214678238 | 18.731839985306 |123 |301 27 |18092 
catabolic process, nonsense- 06E-19 500 
mediated decay 
GO:BP | secretion by cell GO:0032940 | 1.917682371657 | 18.717223323995 | 1396 |301 79 |18092 
53E-19 300 
GO:BP | secretion GO:0046903 | 1.087291023253 | 17.963654197517 | 1538 | 301 82 | 18092 
26Е-18 400 
СО:ВР | wound healing GO:0042060 | 1.110472477338 | 17.954492200814 | 553 | 301 49 |18092 
41E-18 900 
GO:BP | protein localization to GO:0070972 | 1.815505999008 | 17.741002311701 | 147 |301 28 |18092 
endoplasmic reticulum 4Е-18 800 
GO:BP | hemostasis GO:0007599 | 1.009341663569 | 16.995961799584 |352 |301 39 |18092 
94E-17 3 
GO:BP | coagulation GO:0050817 | 1.371794276822 | 16.862711013366 |355 |301 39 |18092 
35Е-17 80 
GO:BP | cell activation GO:0001775 | 2.498941804528 | 16.602243857625 | 1510 |301 79 |18092 
22E-17 700 
GO:BP | regulation of biological quality | GO:0065008 | 2.581381504088 | 16.588147806042 |4194 |301 143 |18092 
28E-17 60 
GO:BP | blood coagulation GO:0007596 | 5.063908016954 | 16.295514191634 | 347 |301 38 | 18092 
8E-17 400 
GO:BP | protein localization to membrane | GO:0072657 | 6.326037223197 | 16.198868256969 | 660 |301 51 |18092 
13E-17 50 
СО:ВР | protein targeting to membrane | GO:0006612 | 1.229758642370 | 15.910180116668 | 203 |301 30 |18092 
73Е-16 000 
GO:BP | establishment of protein GO:0090150 | 2.518003061099 | 15.598943746262 |342 |301 37 | 18092 
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GO:BP | response to stress GO:0006950 2.688479452490 | 15.570493278399 | 4199 |301 141 |18092 
71E-16 800 
GO:BP | response to organic substance GO:0010033 3.527528451251 | 15.452529474761 | 3459 |301 125 |18092 
35Е-16 000 
GO:BP | protein targeting GO:0006605 6.342479706108 |15.197740913706 | 441 301 4l 18092 
2E-16 40 
GO:BP | response to chemical GO:0042221 6.465924754590 |15.189369353676 |4834 |301 153 |18092 
3E-16 900 
GO:BP GO:0045184 6.713259572646 |15.173066560205 |2044 |301 91 18092 
13E-16 500 
GO:BP | translational initiation GO:0006413 9.053775788168 | 15.043170264774 | 200 301 29 18092 
78Е-16 10 
GO:BP | response to wounding GO:0009611 1.462660702002 | 14.834856406846 | 682 301 50 18092 
02Е-15 80 
GO:BP | organonitrogen compound GO:1901564 1.887055654885 | 14.724215291009 | 6780 | 301 188 | 18092 
metabolic process 72E-15 800 
GO:BP | protein localization to organelle | GO:0033365 2.187337181536 | 14.660084264610 | 1014 |301 61 18092 
38E-15 600 
GO:BP | cellular component organization | GO:0071840 4.215516251297 | 14.375149232104 | 6767 |301 187 |18092 
or biogenesis 2Е-15 400 
GO:BP | cellular response to chemical GO:0070887 5.994677375448 | 14.222234185014 | 3438 |301 122 |18092 
stimulus 83E-15 300 
GO:BP | viral transcription GO:0019083 6.522404324043 | 14.185592282767 | 180 301 27 18092 
03Е-15 400 
СО:ВР | regulation of body fluid levels GO:0050878 4.000108946118 | 13.397928180158 | 520 301 42 18092 
13E-14 600 
GO:BP | protein transport GO:0015031 5.125508426212 | 13.290263048175 | 1946 |301 85 18092 
96E-14 300 
GO:BP | peptide transport GO:0015833 5.788456708118 | 13.237437210450 | 1989 |301 86 18092 
94Е-14 200 
GO:BP | viral gene expression GO:0019080 6.982986476026 | 13.155958799057 | 197 301 27 18092 
38Е-14 800 
GO:BP | anion transport GO:0006820 7.550939311493 | 13.121999020235 | 2914 | 301 108 | 18092 
25Е-14 200 
GO:BP | tissue development GO:0009888 8.571304385815 | 13.066953081874 | 2161 |301 90 18092 
72Е-14 300 
GO:BP | organic substance transport GO:0071702 8.971843576426 | 13.047118306922 | 2790 |301 105 | 18092 
Е-14 200 
GO:BP | nitrogen compound transport GO:0071705 1.647331186075 | 12.783219079746 | 2347 |301 94 18092 
45Е-13 000 
GO:BP | amide transport GO:0042886 1.688346689144 | 12.772538369486 | 2024 | 301 86 18092 
39Е-13 100 
GO:BP | neutrophil mediated immunity GO:0002446 3.049030979453 | 12.515838162995 | 498 301 40 18092 
69Е-13 30 
GO:BP | intracellular transport GO:0046907 3.758327719192 | 12.425005352912 |1777 |301 79 18092 
22Е-13 600 
GO:BP | nuclear-transcribed mRNA GO:0000956 6.674010543622 | 12.175613111580 | 215 301 27 18092 
catabolic process 14E-13 900 
GO:BP | cellular component biogenesis GO:0044085 7.110809218116 |12.148080973255 | 3276 |301 114 |18092 
25Е-13 500 
GO:BP | macromolecule localization GO:0033036 8.463279304102 | 12.072461326355 | 3238 | 301 113 | 18092 
37E-13 400 
GO:BP | cellular protein localization GO:0034613 1.784919430815 | 11.748381382656 |2024 |301 84 18092 
05Е-12 600 
GO:BP | cellular macromolecule GO:0070727 2.453350079383 | 11.610240475969 | 2035 |301 84 18092 
localization 67E-12 500 
GO:BP | cellular component organization | GO:0016043 2.967765461236 | 11.527570423782 | 6576 |301 177 | 18092 
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GO:BP | neutrophil degranulation GO:0043312 | 3.954496535148 | 11.402908800643 |484 |301 38 |18092 
2Е-12 000 

СО:ВР |mRNA catabolic proc GO:0006402 | 4.587334112297 | 11.338439627462 |386 |301 34 |18092 
5E-12 800 

СО:ВР | ion transport GO:0006811 | 4.792980954374 | 11.319394296738 | 3689 |301 121 |18092 
54E-12 800 

GO:BP | neutrophil activation involved іп | GO:0002283 | 4.832566201605 |11.315822187833 |487 |301 38 |18092 
immune response 6E-12 30 

GO:BP | protein localization GO:0008104 | 6.657070962613 | 11.176716813559 |2789 |301 101 |18092 
01E-12 500 

GO:BP |protein metabolic process GO:0019538 | 8.367546871384 |11.077401846233 | 5806 |301 162 |18092 
52E-12 20 

СО:ВР | biological adhesion GO:0022610 | 1.021978093593 | 10.990558413333 | 1501 |301 69 |18092 
27E-11 900 

GO:BP | neutrophil activation GO:0042119 | 1.131886580586 | 10.946197088954 | 500 |301 38 |18092 
42Е-11 500 

СО:ВР | extracellular matrix organization | GO:0030198 | 1.229193141214 | 10.910379871732 |399 |301 34 |18092 
63E-11 40 

GO:BP | myeloid leukocyte mediated GO:0002444 | 1.279554991810 | 10.892941044574 | 556 |301 40 |18092 
immunity 83E-11 200 

GO:BP | extracellular structure GO:0043062 | 1.323801935803 | 10.878176988176 | 400 |301 34 |18092 
organization 41E-11 000 

GO:BP | negative regulation of biological | GO:0048519 | 1.324365576380 | 10.877992116188 | 5833 |301 162 |18092 
pr 83E-11 000 

GO:BP | protein folding GO:0006457 | 1.665220654693 | 10.778528210946 |224 |301 26 |18092 
76E-11 500 

GO:BP | granulocyte activation GO:0036230 | 1.769213286319 | 10.752219807854 |507 |301 38 |18092 
78E-11 500 

GO:BP | leukocyte mediated immunity | GO:0002443 | 1.792422155062 | 10.746559696670 |891 |301 51 |18092 
28E-11 500 

GO:BP | immune effector process GO:0002252 | 2.695461319810 | 10.569366895967 | 1310 |301 63 |18092 
72E-11 900 

GO:BP | cellular response to cytokine GO:0071345 — |4.618235910015 |10.335523886100 |1148 | 301 58 |18092 
stimulus 82E-11 300 

GO:BP | intracellular protein transport GO:0006886 | 7.211478089639 | 10.141975711649 | 1195 |301 59 |18092 
51E-11 00 

GO:BP |RNA catabolic process GO:0006401 | 7.860061519972. | 10.104574054764 | 225 [301 34 |18092 
74E-11 600 

GO:BP | cellular response to organic GO:0071310 | 8.993734311181 | 10.046059946309 |2812 |301 99 |18092 
substance 27E-M 100 

GO:BP | cell adhesion GO:0007155 | 9.310328428361 | 10.031034998706 |1494 | 301 67 |18092 
18E-11 20 

GO:BP | immune system process GO:0002376 | 1.100747594968 | 9.9583122547605 |3322 |301 110 |18092 
62E-10 60 

GO:BP | leukocyte degranulation GO:0043299 | 1.237531977043 | 9.9074435703548 |539 |301 38 |18092 
15E-10 30 

GO:BP | cell activation involved іп GO:0002263 | 1.940096108072 | 9.7121767555482 | 723 [301 44 |18092 
immune response 63E-10 50 

GO:BP | cellular component assembly GO:0022607 | 2.087046000680 | 9.6804679785235 | 3030 |301 103 |18092 
91E-10 70 

GO:BP | myeloid cell activation involved | GO:0002275 | 2.204470938968 | 9.6566956220038 | 549 |301 38 |18092 
іп immune response 15E-10 90 

GO:BP | response to cytokine GO:0034097 | 3.829600417983 | 9.4168465381248 | 1242 | 301 59 |18092 
12E-10 80 

GO:BP | organic substance catabolic GO:1901575 | 4.409948812174 |9.3555664515115 |2271 |301 85 | 18092 
process 95Е-10 20 

GO:BP | nucleobase-containing compound | GO:0034655 | 5.703685541864 | 9.2438444262132 | 566 |301 38 |18092 
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5Е-10 


80 
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GO:BP | platelet activation GO:0030168 | 6.247993957779 | 9.2042593991218 | 160 |301 21 |18092 
45Е-10 40 

GO:BP | leukocyte activation involved іп | GO:0002366 | 6.957131565641 | 9.1575697836638 | 719 |301 аз |18092 
immune response 33E-10 00 

СО:ВР | cellular protein metabolic process | GO:0044267 | 8.488219778395 | 9.0711833841363 | 5225 |301 146 |18092 
18E-10 40 

GO:BP | myeloid leukocyte activation GO:0002274 | 1.452428863017 | 8.8379051292683 |673 |301 41 |18092 
89Е-09 40 

GO:BP | сей death GO:0008219 | 1.516503910305 | 8.8191564655236 |2322 |301 8 |18092 
1E-09 2 

GO:BP | macromolecule catabolic process | GO:0009057 | 1.884391749686 | 8.7248288058736 | 1477 |301 64 [18092 
97E-09 60 

GO:BP GO:0009653 | 2.073891302399 | 8.6832140097623 | 2865 |301 9 |18092 
57Е-09 80 

GO:BP | programmed cell death GO:0012501 | 2.468038582218 | 8.6076480553706 |2172 | 301 81 | 18092 
8E-09 60 

GO:BP | negative regulation of apoptotic | GO:0043066 | 2.548224719217 | 8.5937622757037 | 944 |301 49 |18092 
process 218-09 90 

СО:ВР | regulation of protein stability GO:0031647 | 2.936105885173 | 8.5322282864544 |302 |301 27 |18092 
46Е-09 20 

СО:ВР | response to stimulus GO:0050896 | 3.125411680208 | 8.5050927691464 |9469 |301 217 |18092 
328-09 30 

СО:ВР | heterocycle catabolic process GO:0046700 | 3.603947713563 | 8.4432215184353 | 601 |301 38 |18092 
63E-09 5 

GO:BP | cellular nitrogen compound GO:0044270 |4.635360371116 | 8.3339164964486 |606 |301 38 |18092 
catabolic process 24E-09 50 

GO:BP | platelet aggregation GO:0070527 | 4.973479496395 | 8.3033396681355 |64 |301 14 |18092 
68Е-09 60 

GO:BP | negative regulation of GO:0043069 | 6.138459404772 | 8.2119406119140 |967 |301 49 |18092 
programmed cell death 33E-09 70 

GO:BP | catabolic process GO:0009056 | 7.627026534980 | 8.1176447426665 | 2746 |301 93 |18092 
37E-09 50 

GO:BP | organic cyclic compound GO:1901361 | 9.012674259590 | 8.0451463251300 |650 |301 39 [18092 
catabolic process 35Е-09 10 

GO:BP ic compound catabolic | GO:0019439 | 1.018297158391 |7.9921254681509 | 622 |301 38 |18092 
35E-08 80 

GO:BP | regulation of localization GO:0032879 | 1.173282629521 | 7.9305973590211 | 2903 |301 96 |18092 
6Е-08 80 

СО:ВР | negative regulation of GO:0050819 | 1.370347344619 | 7.8631693374187 |56 |301 13 |18092 
coagulation 96E-08 40 

GO:BP | regulation of proteolysis GO:0030162 | 1.379812224587 | 7.8601800116958 |754 |301 42 |18092 
94Е-08 70 

GO:BP | movement of cell ог subcellular | GO:0006928 | 1.581668481193 | 7.8008845397234 | 2337 |301 вз |18092 
component 22E-08 20 

GO:BP | negative regulation of cell death | GO:0060548 | 1.749815288132 | 7.7570077933585 | 1066 |301 51 |18092 
8E-08 80 

GO:BP | protein stabilization GO:0050821 | 2.042813067671 | 7.6897713726690 | 191 |301 21 |18092 
67E-08 80 

GO:BP charide biosynthetic GO:0046364 | 2.110441957922 | 7.6756265874567 | 100 |301 16 |18092 
8E-08 3 

GO:BP | endocytosis GO:0006897 | 2.220353735210 | 7.6535778304988 | 701 |301 40 |18092 
85Е-08 7 

GO:BP | regulation of peptidase activity | GO:0052547 | 3.342820708790 | 7.4758869160401 |469 |301 32 |18092 
07E-08 70 

GO:BP | regulation of apoptotic process | GO:0042981 |3.530777440211 | 7.4521296569863 | 1579 |301 64 [18092 
64Е-08 90 

GO:BP | leukocyte activation GO:0045321 |3.544782941471 | 7.4504103529230 | 1346 |301 58 |18092 
22E-08 60 
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GO:BP | homotypic cell-cell adhesion GO:0034109 | 3.709955980883 | 7.4306312433155 | 88 301 15 |18092 
86Е-08 90 
СО:ВР | cellular macromolecule catabolic | GO:0044265 | 4.004581499173 | 7.3974428633740 |1237 |301 55 |18092 
ргосевв 748-08 70 
GO:BP e regulation of biological | GO:0048518 | 4.207948123314 | 7.3759296229173 | 6466 | 301 164 |18092 
ргосевв 28E-08 50 
GO:BP | regulation of coagulation GO:0050818 | 4.945208054177 | 7.3058154320990 | 75 301 14 |18092 
79Е-08 40 
GO:BP | regulation of cell death GO:0010941 | 5.127826953607 | 7.2900666391759 | 1754 |301 68 |18092 
23E-08 7 
GO:BP | cell-substrate adhesion GO:0031589 | 5.397856646955 | 7.2677786533591 |368 |301 28 |18092 
09Е-08 70 
GO:BP | negative regulation of hemostasis | GO:1900047 | 5.941738545587 | 7.2260864623786 |50 |301 12 |18092 
86Е-08 50 
GO:BP | negative regulation of peptidase | GO:0010466 | 6.720176727683 | 7.1726193056889 |271 |301 24 |18092 
activity 4E-08 60 
GO:BP | translation GO:0006412 | 7.790345230100 |7.1084432960879 | 763 |301 4 |18092 
24E-08 80 
GO:BP | substrate adhesion-dependent cell | GO:0034446 | 8.177416516734 | 7.0873838808938 | 109 |301 16 | 18092 
spreading 37E-08 00 
GO:BP | regulation of programmed cell | GO:0043067 | 9.018631364465 | 7.0448593644491 |1614 | 301 64 [18092 
death 87E-08 10 
GO:BP | viral life cycle GO:0019058 | 9.748717884832 | 7.0110524973431 |351 |301 27 |18092 
29E-08 1 
GO:BP tive regulation of metabolic | GO:0009892 | 1.082136167437 | 6.9657180876157 | 3539 |301 107 |18092 
] 758-07 50 
СО:ВР | negative regulation of GO:0010951 | 1.544357953845 | 6.8112520308381 |258 |301 23 |18092 
2Е-07 90 
GO:BP | cellular developmental process | GO:0048869 | 1.787780923390 | 6.7476857011954 |4518 |301 126 |18092 
86Е-07 30 
GO:BP | regulation of hemostasis GO:1900046 | 1.902525330300 | 6.7206695523044 | 68 301 1 |18092 
02E-07 10 
GO:BP | peptide biosynthetic process GO:0043043 | 2.099804426579 | 6.6778211530816 |788 |301 41 |18092 
44E-07 40 
GO:BP | immune response GO:0006955 | 2.187701092276 | 6.6600120163618 | 2326 |301 80 [18092 
52E-07 30 
GO:BP | localization of cell GO:0051674 | 2.292578716204 | 6.6396757438034 | 1856 |301 69 |18092 
62E-07 600 
GO:BP | cell motility GO:0048870 | 2.292578716204 | 6.6396757438034 | 1856 | 301 69 |18092 
62E-07 600 
GO:BP | negative regulation of GO:0010605 | 2.457999135004 | 6.6094182742823 | 3297 |301 101 |18092 
macromolecule metabolic 26E-07 300 
process 
GO:BP | negative regulation of proteolysis | GO:0045861 | 2.520048223097 | 6.5985911485736 | 366 |301 27 | 18092 
92Е-07 40 
GO:BP | peptide metabolic process GO:0006518 | 3.639216685945 | 6.4389920849170 | 941 |301 45 |18092 
18E-07 30 
GO:BP | supramolecular fiber organization | GO:0097435 | 4.205075134299 | 6.3762262400277 | 739 |301 зә |18092 
04E-07 900 
GO:BP |NADH metabolic process GO:0006734 | 4.264935313718 | 6.3700875513916 |46 |301 11 |18092 
54Е-07 80 
GO:BP | apoptotic process GO:0006915 | 5.153603918010 | 6.2878889623533 |2018 |301 72 |18092 
51Е-07 
СО:ВР | response to inorganic substance | GO:0010035 | 5.671206417519 | 6.2463245452094 | 585 |301 34 
66Е-07 50 
GO:BP | сей differentiation GO:0030154 | 5.762512808304 | 6.2393880962973 | 4440 |301 123 |18092 
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GO:BP | anatomical structure development | GO:0048856 | 5.892954998428 | 6.2296668753835 | 6210 |301 156 |18092 
28E-07 40 

GO:BP | regulation of endopeptidase GO:0052548 | 6.176028183869 | 6.2092907307955 | 437 |301 29 |18092 
activity 51E-07 300 

GO:BP | positive regulation of cellular | GO:0051130 | 7.200232868430 | 6.1426534574800 |1182 |301 51 (18092 
component organization 06E-07 20 

GO:BP | positive regulation of transport | 60:0051050 | 8.390828339042 | 6.0761951636864 | 930 |301 44 |18092 
26E-07 70 

GO:BP | negative regulation of blood GO:0030195 | 8.910505478395 | 6.0500976584489 |49 |301 11 [18092 
coagulation 39E-07 8 

GO:BP | negative regulation of protein | GO:0051248 | 0.000001019233 | 5.9917264227373 |1156 |301 50 | 18092 
metabolic process 2364856900 50 

GO:BP | regulation of protein localization | GO:0032880 | 0.000001027018 | 5.9884215668718 | 936 |301 44 | 18092 
893456050 20 

GO:BP | blood coagulation, fibrin clot GO:0072378 | 0.000001080163 | 5.9665105178135 | 28 |301 9 (18092 
formation 4609929200 20 

GO:BP | protein activation cascade GO:0072376 | 0.000001080163 | 5.9665105178135 |28 |301 9 [18092 
4609929200 20 

GO:BP | glycolytic process through GO:0061620 | 0.000001080163 | 5.9665105178135 |28 |301 9 (18092 
glucose-6-phosphate 4609929200 20 

GO:BP | regulation of protein metabolic | GO:0051246 | 0.000001091348 | 5.9620366586972 |2722 |301 87 | 18092 
process 2121092900 70 

GO:BP | locomotion GO:0040011 | 0.000001161998 | 5.9347944055448 |2055 |301 72 |18092 
5723008700 60 

GO:BP | cytoskeleton organization GO:0007010 | 0.000001264949 | 5.8979267318631 |1435 |301 57 | 18092 
734232240 00 

GO:BP | cell migration GO:0016477 | 0.000001451560 | 5.8381647653142 | 1684 |301 63 | 18092 
8110332900 30 

GO:BP | anatomical structure homeostasis | GO:0060249 | 0.000001461329 | 5.8352518805137 | 483 |301 30 | 18092 
392196080 50 

GO:BP | mRNA metabolic process GO:0016071 | 0.000001541246 | 5.8121278510724 | 877 |301 42 | 18092 
661676130 20 

GO:BP | glycolytic process through GO:0061615 | 0.000001543519 | 5.8114880030769 | 29 |301 9 [18092 
fructose-6-phosphate 0608075600 70 

GO:BP | regulation of blood coagulation | GO:0030193 | 0.000002280838 15.6419054654725 |67 |301 12 |18092 
496737500 50 

GO:BP | negative regulation of catalytic | GO:0043086 | 0.000002429370 | 5.6145062011262 |855 |301 41 | 18092 
activity 7537088300 50 

GO:BP | amide biosynthetic process GO:0043604 | 0.000002584138 | 5.5876842624994 | 928 |301 аз | 18092 
208561930 60 

GO:BP | membrane organization GO:0061024 | 0.000002923483 | 5.5340993972429 | 968 |301 44 | 18092 
2018855500 40 

GO:BP | developmental process GO:0032502 | 0.000003577603 | 5.4464077916136 | 6745 | 301 163 | 18092 
5135248000 80 

GO:BP | cytoplasmic translation GO:0002181 | 0.000003998460 | 5.3981071608408 | 103 |301 14 (18092 
767862170 10 

GO:BP | regulation of cellular protein GO:0032268 | 0.000004082611 | 5.3890619665359 | 2563 |301 82 | 18092 
metabolic process 301632350 70 

GO:BP | positive regulation of cellular | GO:0048522 | 0.000004362968 | 5.3602178745473 | 5863 |301 147 | 18092 
pro 982190420 10 

GO:BP | positive regulation of cellular |GO:1903829 | 0.000004490078 | 5.3477460346632 |306 |301 23 |18092 
protein localization 825679300 60 

GO:BP |receptor-mediated endocytosis | GO:0006898 | 0.000004568015 | 5.3402724545035 |333 |301 24 |18092 
254149030 2 

GO:BP | response to external stimulus GO:0009605 | 0.000004774034 | 5.3211144249904 |2991 |301 91 (18092 
740281340 30 

GO:BP | post-translational protein GO:0043687 | 0.000004995227 | 5.3014447643086 | 362 |301 25 |18092 
modification 079037370 10 
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GO:BP | plasminogen activation GO:0031639 | 0.000005027880 | 5.2986150357169 | 23 301 8 18092 
699445840 900 

GO:BP | protein refolding GO:0042026 | 0.000005027880 | 5.2986150357169 | 23 301 8 18092 
699445840 900 

GO:BP | cellular catabolic process GO:0044248 | 0.000005315046 | 5.2744929631691 | 2350 |301 77 |18092 
113539450 200 

GO:BP | response 10 topologically GO:0035966 | 0.000005335772 | 5.2728026681546 |209 |301 19 |18092 
incorrect protein 83722167 90 

GO:BP substance metabolic GO:0071704 | 0.000005486392 | 5.2607131346951 | 11403 | 301 237 | 18092 
3891106600 40 

СО:ВР | response to unfolded protein GO:0006986 | 0.000005715082 | 5.2429775295656 | 187 [301 18 |18092 
057912700 70 

GO:BP | regulation of cellular protein GO:1903827 | 0.000007140423 | 5.1462760449724 | 580 |301 32 |18092 
localization 243376140 30 

GO:BP | regulation of biological process | GO:0050789 | 0.000008884983 | 5.0513433545114 | 12145 |301 247 |18092 
906303410 6 

GO:BP | organelle organization GO:0006996 | 0.000009229851 | 5.0348052665390 | 4115 |301 113 |18092 
920502390 50 

GO:BP | regulation of complement GO:0030449 | 0.000009617859 | 5.0169215777593 | 110 |301 14 |18092 
activation 361245670 50 

GO:BP | regulation of catalytic activity | GO:0050790 | 0.000009902679 | 5.0042472858288 |2428 |301 78 |18092 
290792230 10 

GO:BP |interleukin-12-mediated signaling | GO:0035722 | 0.000010015551 | 4.9993251429165 | 47 301 10 |18092 
pathway 238132200 00 

GO:BP | regulation of cellular component | GO:0051128 | 0.000010472206 |4.9799617925123 |2431 |301 78 |18092 
organization 742755900 30 

GO:BP | fibrinolysis GO:0042730 | 0.000010774418 | 4.9676061604170 | 25 301 8 18092 
479598400 50 

GO:BP | protein-containing complex GO:0065003 | 0.000011257726 | 4.9485492994350 | 1689 |301 61 |18092 
assembly 682688300 30 

GO:BP | response to abiotic stimulus GO:0009628 | 0.000011606711 | 4.9352907943126 | 1282 |301 51 |18092 
929473000 70 

СО:ВР | biological process involved іп | GO:0051702 | 0.000014165981 | 4.8487533232442 | 95 301 1 |18092 
interaction with symbiont 710648700 10 

GO:BP | cellular response to interleukin- | GO:0071349 | 0.000015434774 |4.8114996974723 |49 |301 10 |18092 
12 98503850 600 

СО:ВР | cell morphogenesis involved іп | GO:0000904 | 0.000015822553 | 4.8007234327566 | 769 |301 37 |18092 
differentiation 297301300 00 

GO:BP | system development GO:0048731 | 0.000016676171 | 4.7779036478042 | 5151 |301 132 |18092 
476017000 50 

GO:BP | cell morphogenesis GO:0000902 | 0.000016752533 | 4.7759195049338 | 1063 |301 45 |18092 
49999010 30 

GO:BP | metabolic process GO:0008152 | 0.000018834872 | 4.7250373093875 | 11998 | 301 244 |18092 
75839680 90 

GO:BP | response to interleukin-12 GO:0070671 | 0.000019012065 |4.7209707028477 |50 |301 10 |18092 
294426500 00 

GO:BP | cellular amide metabolic process | GO:0043603 | 0.000019487188 | 4.7102508195351 |1223 |301 4 |18092 
246454700 10 

GO:BP | proteolysis GO:0006508 | 0.000021227589 | 4.6730993110623 | 1846 |301 64 [18092 
905868700 70 

GO:BP | glucose catabolic process to GO:0061718 | 0.000021484979 | 4.6678650555830 | 27 301 8 18092 
pyruvate 53864230 10 

СО:ВР | NADH regeneration GO:0006735 | 0.000021484979 | 4.6678650555830 | 27 301 8 18092 
53864230 10 

GO:BP | canonical glycolysis GO:0061621 | 0.000021484979 | 4.6678650555830 | 27 301 8 18092 
53864230 10 

GO:BP | biological regulation GO:0065007 | 0.000021800972 | 4.6615241424679 | 12801 | 301 255 | 18092 
02024860 10 
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GO:BP | animal organ development GO:0048513 | 0.000021973042 | 4.6581097953188 |3770 |301 105 |18092 
953267500 40 
GO:BP | monosaccharide metabolic GO:0005996 | 0.000025683785 | 4.5903409582355 |281 |301 21 |18092 
pro 946311200 00 
GO:BP |NAD metabolic process GO:0019674 | 0.000028432122 | 4.5461907253148 |52 |301 10 |18092 
045595300 40 
GO:BP | response to oxygen-containing | GO:1901700 | 0.000028499171 | 4.5451677650908 |1774 |301 62 |18092 
compound 506625600 10 
GO:BP | regulation of hydrolase activity | GO:0051336 | 0.000029046048 | 4.5369129416270 |1318 |301 51 (18092 
511983800 80 
GO:BP | regulation of cell morphogenesis | GO:0022604 | 0.000032841495 | 4.4835770791854 |312 |301 22 |18092 
15903640 3 
GO:BP | negative regulation of cellular | GO:0048523 | 0.000034799454 | 4.4584275673561 | 5208 |301 132 |18092 
pro 214857200 40 
GO:BP | primary metabolic process GO:0044238 | 0.000041638376 | 4.3805062106083 | 10575 | 301 222 |18092 
65056050 50 
GO:BP | cell junction organization GO:0034330 | 0.000045934537 | 4.3378606552019 | 730 |301 35 [18092 
16529250 20 
GO:BP | positive regulation of GO:1904951 | 0.000046049910 | 4.3367712095251 |346 |301 23 |18092 
establishment of protein 501205500 200 
localization 
GO:BP | protein-containing complex GO:0043933 | 0.000048736099 | 4.3121492319005 | 1974 |301 66 |18092 
subunit organization 47572140 00 
GO:BP | carbohydrate biosynthetic GO:0016051 | 0.000059905680 | 4.2225319905599 |217 |301 18 |18092 
pro 987057700 20 
GO:BP | blood vessel development GO:0001568 | 0.000061712955 | 4.2096236521933 | 810 |301 37 | 18092 
79316560 00 
GO:BP e regulation of hydrolase | GO:0051346 | 0.000071176915 | 4.1476608386883 |475 |301 27 |18092 
activity 12019530 00 
GO:BP | hexose catabolic process GO:0019320 | 0.000072114703 |4.1419761751608 |57 |301 10 [18092 
93009720 10 
GO:BP | regulation of molecular function |GO:0065009 | 0.000077572681 | 4.1102911931214 |2969 |301 87 18092 
97599670 900 
GO:BP | response to endogenous stimulus | GO:0009719 | 0.000086260346 | 4.0641888021066 | 1698 |301 59 | 18092 
41227820 10 
GO:BP | regulation of humoral immune | GO:0002920 | 0.000093732855 | 4.0281081515367 | 131 |301 14 | 18092 
response 6694432 40 
GO:BP | regulation of establishment of | GO:0070201 | 0.000115080721 | 3.9389974234913 | 618 |301 31 | 18092 
protein localization 62379900 800 
GO:BP | negative regulation of wound | 60:0061045 | 0.000117561283 | 3.9297356825207 | 76 |301 11 | 18092 
healing 10791800 600 
GO:BP | hexose biosynthetic process GO:0019319 | 0.000121731083 | 3.9145985119870 |94 |301 12 | 18092 
61908600 10 
GO:BP | positive regulation of organelle | GO:0010638 | 0.000123783502 | 3.9073372320448 | 620 |301 31 | 18092 
organization 7569280 70 
GO:BP | regulation of wound healing GO:0061041 | 0.000125145830 | 3.9025836137738 | 134 |301 14 | 18092 
92845700 900 
GO:BP | homeostatic process GO:0042592 | 0.000151222446 | 3.8203837406493 |2033 |301 66 |18092 
32057800 20 
GO:BP | multicellular organismal pr GO:0032501 | 0.000151256558 | 3.8202857838759 | 8065 |301 180 | 18092 
96590000 200 
GO:BP | multicellular organism GO:0007275 | 0.000167498242 | 3.7759897452775 | 5718 |301 139 | 18092 
development 58659400 80 
GO:BP | nitrogen compound metabolic | GO:0006807 | 0.000168181346 | 3.7742221746579 | 10057 |301 212 | 18092 
process 54604400 80 
GO:BP | cell population proliferation GO:0008283 | 0.000192689666 | 3.7151415751081 | 2091 |301 67 | 18092 
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GO:BP | vasculature development GO:0001944 | 0.000193345341 | 3.7136662876658 | 847 |301 37 | 18092 
399463 100 

GO:BP charide catabolic GO:0046365 | 0.000195438174 | 3.7089906031960 | 63 301 10 |18092 
21598800 30 

GO:BP | anatomical structure formation | GO:0048646 | 0.000196181186 | 3.7073426432580 | 1235 |301 47 |18092 
involved іп morphogenesis 46905500 000 

GO:BP | cellular metabolic process GO:0044237 | 0.000200873042 | 3.6970783429544 | 10855 | 301 224 |18092 
23931300 600 

GO:BP | peptide cross-linking GO:0018149 | 0.000202886535 | 3.6927467745137 | 35 301 8 18092 
17806800 500 

GO:BP | epithelium development GO:0060429 | 0.000213594293 | 3.6704103549314 | 1361 |301 50 |18092 
20584800 90 

СО:ВР | positive regulation of cellular | GO:0032270 | 0.000233518557 | 3.6316786015068 |1490 |301 5з |18092 
protein metabolic process 0944970 30 

GO:BP | glucose catabolic process GO:0006007 | 0.000257111435 | 3.5898786065270 |36 [301 8 18092 
7139800 00 

GO:BP | actin filament-based process GO:0030029 | 0.000293343558 | 3.5326234437385 | 824 |301 36 |18092 
655817 900 

GO:BP | cell development GO:0048468 | 0.000300312777 | 3.5224261900440 |2161 | 301 68 |18092 
01811400 500 

GO:BP | humoral immune response GO:0006959 | 0.000309046387 | 3.5099763292562 | зва |301 23 |18092 
026255 10 

GO:BP | cell-cell adhesion GO:0098609 | 0.000313148072 | 3.5042502565308 | 901 |301 38 | 18092 
9385730 300 

GO:BP | regulation of response to GO:1903034 | 0.000315426069 | 3.5011024158672 | 167 |301 15 |18092 
wounding 3963440 80 

GO:BP | organonitrogen compound GO:1901566 | 0.000341245320 | 3.4669332959498 | 1854 | 301 61 |18092 
biosynthetic process 06351300 100 

GO:BP | negative regulation of cellular | GO:0032269 | 0.000352980905 | 3.4522487874056 | 1099 | 301 аз |18092 
protein metabolic process 2793530 80 

GO:BP | actin cytoskeleton organization | GO:0030036 | 0.000379819173 | 3.4204231156599 | 723 | 301 зз | 18092 
44699900 400 

GO:BP | regulation of developmental GO:0050793 | 0.000431902753 | 3.3646140266984 | 2648 | 301 78 | 18092 
process 9945540 5 

GO:BP | regulation of metabolic process | GO:0019222 | 0.000491690551 | 3.3083081367993 |7174 |301 163 |18092 
9382150 700 

GO:BP | regulation of transport GO:0051049 | 0.000500216683 | 3.3008418273699 | 1830 |301 60 |18092 
6926320 800 

СО:ВР | regulation of cell-substrate GO:0010810 | 0.000500465635 | 3.3006257379872 |223 |301 17 |18092 
adhesion 5319780 800 

СО:ВР |'de novo protein folding GO:0006458 | 0.000500572568 | 3.3005329539385 |39 |301 8 18092 
0178950 200 

GO:BP | leukocyte migration GO:0050900 | 0.000501652673 | 3.2995968686654 |523 |301 27 |18092 
7583740 400 

GO:BP | establishment or maintenance of | GO:0007163 | 0.000645038861 | 3.1904141195662 |227 |301 17 |18092 
cell polarity 7611290 90 

GO:BP | negative regulation of fibrinolysis | GO:0051918 | 0.000691024043 | 3.1605068413138 |10 |301 5 18092 
8197370 300 

СО:ВР | negative regulation of response to | GO:1903035 | 0.000778688927 | 3.1086360007914 | 91 301 11 |18092 
wounding 2873790 300 

GO:BP | positive regulation of protein GO:1900182 | 0.000778688927 | 3.1086360007914 | 91 301 11 |18092 
localization to nucleus 2873790 300 

GO:BP | hexose metabolic pr GO:0019318 | 0.000822820252 | 3.0846950272482 |258 |301 18 |18092 
6572610 90 

GO:BP | complement activation GO:0006956 | 0.000839429273 | 3.0760158894144 | 180 |301 15 |18092 
9863420 40 

GO:BP | chaperone cofactor-dependent | GO:0051085 | 0.000893344587 | 3.0489809896085 |29 |301 7 18092 
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GO:BP | circulatory system development | GO:0072359 | 0.000909475943 | 3.0412087841121 |1219 |301 45 | 18092 
1581390 50 

GO:BP | zymogen activation GO:0031638 | 0.000921601433 | 3.0354568583651 |57 |301 9 (18092 
5725130 500 

GO:BP | regulation of cell adhesion GO:0030155 | 0.000929784482 | 3.0316177063711 | 752 |301 зз | 18092 
1030280 100 

GO:BP | ribosomal large subunit GO:0042273 | 0.000934883133 | 3.0292426754203 |74 |301 10 [18092 
biogenesis 4181520 300 

GO:BP | integrin-mediated signaling GO:0007229 | 0.000950893190 | 3.0218682625801 |113 |301 12 | 18092 
pathway 6390350 30 

GO:BP | diterpenoid metabolic process | GO:0016101 | 0.000950893190 | 3.0218682625801 |113 |301 12 | 18092 
6390350 30 

GO:BP | cell-matrix adhesion GO:0007160 | 0.001118614644 | 2.9513194993602 | 236 |301 17 | 18092 
457080 900 

GO:BP | chaperone-mediated protein GO:0061077 | 0.001250282554 | 2.9029918287313 |59 |301 9 [18092 
folding 114570 00 

GO:BP | cell junction assembly GO:0034329 | 0.001314412222 | 2.8812684111758 | 449 |301 24 |18092 
337920 300 

GO:BP | viral entry into host cell GO:0046718 | 0.001374811088 | 2.8617569738165 | 139 |301 1з | 18092 
2287500 00 

GO:BP | positive regulation of protein GO:0051247 | 0.001427136507 | 2.8455344841225 |1578 |301 53 | 18092 
metabolic process 2787500 600 

GO:BP | positive regulation of ion GO:0043270 | 0.001564203928 | 2.8057066276746 | 696 |301 31 | 18092 
transport 6472000 90 

GO:BP | regulation of response to stress | GO:0080134 | 0.001614786347 | 2.7918849309679 | 1455 |301 50 | 18092 
8667500 400 

GO:BP | substantia nigra development | GO:0021762 | 0.001918955040 | 2.7169352003075 |46 |301 8 | 18092 
462870 800 

GO:BP | regulation of multicellular GO:0051239 | 0.001972292203 | 2.7050287420836 | 2894 |301 81 [18092 
organismal process 3409600 70 

GO:BP | skin development GO:0043588 | 0.002254307715 | 2.6469868025462 | 430 |301 23 | 18092 
600970 900 

GO:BP | positive regulation of cell GO:2000147 | 0.002262503291 |2.6454107802348 | 600 |301 28 | 18092 
motility 8850100 200 

GO:BP | response to nitrogen compound | GO:1901698 | 0.002303533415 | 2.6376054834108 |1178 |301 аз | 18092 
4749100 200 

GO:BP | modulation by host of viral GO:0044788 | 0.002309594681 | 2.6364642292169 |33 |301 7 (18092 
process 4788600 800 

GO:BP | terpenoid metabolic process GO:0006721 | 0.002387248464 | 2.6221023773732 | 123 |301 12 | 18092 
258790 600 

GO:BP | response to organic cyclic GO:0014070 | 0.002403027150 | 2.6192413222508 | 978 |301 38 | 18092 
compound 8309800 400 

GO:BP | positive regulation of endocytosis | GO:0045807 | 0.002750076512 | 2.5606552231342 |103 |301 11 | 18092 
1624100 200 

GO:BP | response to drug GO:0042493 | 0.002792863521 | 2.5539502863568 |403 |301 22 | 18092 
730270 70 

GO:BP e phosphate metabolic GO:0019693 | 0.002858038568 | 2.5439319148730 |436 |301 23 |18092 
ess 1011900 70 

GO:BP | regulation of cellular localization | GO:0060341 | 0.002896952939 | 2.5380585597864 |868 |301 35 [18092 
442360 00 

СО:ВР | response to oxidative stress GO:0006979 | 0.003432362312 | 2.4644068753100 |474 |301 24 |18092 
0101800 20 

GO:BP | positive regulation of cellular | GO:0051272 | 0.003442280017 | 2.4631538042266 | 613 |301 28 |18092 
component movement 501070 100 

GO:BP | blood vessel morphogenesis GO:0048514 | 0.003490601994 | 2.4570996675359 | 723 |301 31 [18092 
1968300 10 

GO:BP |positive regulation of cell GO:0030335 | 0.003503704362 | 2.4554725460743 | 578 |301 27 |18092 
migration 6863600 400 
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СО:ВР | regulation of cellular component | GO:0051270 | 0.003530639847 | 2.4521465815402 |1156 |301 42 |18092 
movement 7002700 10 

GO:BP |'de novo posttranslational protein | GO:0051084 | 0.003533291395 | 2.4518205441950 | 35 301 7 18092 
folding 644010 300 

СО:ВР | positive regulation of locomotion | GO:0040017 | 0.003667497424 | 2.4356301824254 |615 |301 28 |18092 
8578900 800 

GO:BP | regulation of cell motility GO:2000145 | 0.003677650670 | 2.4344295255328 | 1076 |301 40 |18092 
4535100 300 

GO:BP | entry into host GO:0044409 | 0.003814249296 | 2.4185909252332 | 152 |301 1 |18092 
1878000 500 

GO:BP | positive regulation of cell GO:0008284 | 0.003818451085 | 2.4181127683473 |997 |301 38 |18092 
population proliferation 5596200 900 

GO:BP | retinoid metabolic process GO:0001523 | 0.004027894528 | 2.3949219100854 | 107 |301 11 |18092 
516400 100 

GO:BP | pyruvate metabolic process GO:0006090 | 0.004419745982 | 2.3546026903257 | 154 |301 13 |18092 
052770 800 

GO:BP | small molecule biosynthetic GO:0044283 | 0.004962050448 | 2.3043388246606 |698 |301 30 |18092 
ргосевв 123570 80 

GO:BP | cytokine-mediated signaling GO:0019221 | 0.005052199205 | 2.2965195337515 | 812 |301 зз |18092 
pathway 84263 90 

GO:BP | regulation of fibrinolysis GO:0051917 | 0.005197053858 | 2.2842427824062 |14 |301 5 18092 
9106700 100 

GO:BP | reactive oxygen species GO:0072593 | 0.005440556048 | 2.2643567112502 |294 |301 18 |18092 
metabolic process 468030 900 

GO:BP | regulation of protein transport | GO:0051223 | 0.005503728235 | 2.2593430189837 | 592 |301 27 |18092 
035130 600 

СО:ВР | response to organonitrogen GO:0010243 | 0.005535314095 | 2.2568577304991 | 1094 |301 40 |18092 
compound 618620 400 

GO:BP | regulation of ion transport GO:0043269 | 0.005865112176 |2.2317236771323 | 1391 |301 47 |18092 
41096 600 

GO:BP | tube morphoge GO:0035239 | 0.005872499117 |2.2311770398001 |975 |301 37 | 18092 
696920 500 

GO:BP | apoptotic cell clearance GO:0043277 | 0.005892326971 | 2.2297131616933 | 53 301 8 18092 
554410 20 

GO:BP | regulation of cell migration GO:0030334 | 0.005980216404 | 2.2232831000130 | 1016 | 301 38 | 18092 
711020 40 

СО:ВР | RNA-dependent DNA GO:0006278 | 0.006222962310 | 2.2060028293464 | 71 301 9 18092 
biosynthetic process 137120 10 

GO:BP | regulation of organelle GO:0033043 | 0.006238986731 | 2.2048859380042 | 1224 |301 аз |18092 
organization 238450 00 

GO:BP | small molecule metabolic process | GO:0044281 | 0.006347776501 | 2.1973783727156 |1929 | 301 59 |18092 
236580 20 

СО:ВР | negative regulation of molecular | GO:0044092 | 0.006505517735 | 2.1867181347781 | 1226 | 301 аз |18092 
function 9738200 800 

GO:BP | purine nucleoside triphosphate | GO:0009144 | 0.006519836661 | 2.1857632842844 | 91 301 10 |18092 
metabolic process 940410 200 

GO:BP | gluconeogenesis GO:0006094 | 0.006519836661 | 2.1857632842844 | 91 301 10 |18092 
940410 200 

GO:BP | regulation of anatomical structure | GO:0022603 | 0.006762765784 | 2.1698756532561 |1062 |301 39 |18092 
morphogenesis 803870 10 

GO:BP | cellular detoxification GO:1990748 | 0.006918964303 | 2.1599589100344 |113 |301 11 |18092 
175410 5 

GO:BP | positive regulation of protein GO:0051222 | 0.007031447559 | 2.1529552578628 |330 |301 19 |18092 
transport 37996 400 

СО:ВР | positive regulation of molecular | GO:0044093 | 0.007043075689 | 2.1522376443800 | 1844 | 301 57 |18092 
function 336240 10 

GO:BP | ribonucleoside triphosphate GO:0009199 | 0.007208658328 | 2.1421455583841 | 92 301 10 |18092 


metabolic process 


898660 


10 
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GO:BP | organelle assembly GO:0070925 |0.007228238311 |2.1409675373729 | 904 |301 35 [18092 
906370 
GO:BP | carbohydrate derivative GO:1901135 | 0.007321082488 | 2.1354246998399 | 1148 |301 41 |18092 
metabolic process 038410 10 
GO:BP | nucleoside triphosphate GO:0009141 | 0.007546050439 | 2.1222802962164 | 114 |301 11 |18092 
metabolic proi 269590 900 
GO:BP | positive regulation by host of | 60:0044794 | 0.007689603628 | 2.1140960459191 |15 — |301 5 (18092 
viral proc 961480 600 
GO:BP | regulation of protein localization | GO:1904814 | 0.007689603628 | 2.1140960459191 |15 — |301 5 (18092 
to chromosome, telomeric region 961480 600 
GO:BP | carboxylic acid metabolic process | GO:0019752 | 0.007704197718 | 2.1132725801361 | 1027 |301 38 |18092 
2379600 400 
GO:BP | regulation of anion transport GO:0044070 | 0.007971770140 | 2.0984452323001 |908 |301 35 [18092 
840450 80 
GO:BP | carbohydrate metabolic process | GO:0005975 | 0.008132232244 | 2.0897902270240 | 641 |301 28 |18092 
989790 100 
GO:BP | positive regulation of NF-kappaB | GO:0051092 | 0.008336095250 | 2.0790373316289 |163 |301 1з |18092 
transcription factor activity 882390 700 
GO:BP | glucose metabolic process GO:0006006 | 0.008380425411 |2.0767339349035 |216 |301 15 |18092 
9814 300 
GO:BP | spindle organization GO:0007051 | 0.008400303457 | 2.0757050249460 |189 |301 14 (18092 
491010 500 
GO:BP | positive regulation of response to | GO:1903036 | 0.008841186423 | 2.0534894518777 |74 |301 9 [18092 
wounding 6824 60 
GO:BP | regulation of substrate adhesion- | GO:1900024 | 0.009043988559 | 2.0436399956837 |56 |301 8 (18092 
dependent cell spreading 876810 70 
GO:BP | collagen fibril organization GO:0030199 | 0.009043988559 | 2.0436399956837 |56 |301 8 (18092 
876810 70 
GO:BP | regulation of transforming GO:0071634 | 0.009122524315 | 2.0398849703702 |40 |301 7 (18092 
growth factor beta production 658750 90 
GO:BP | regulation of locomotion GO:0040012 | 0.009380783319 | 2.0277608953996 | 1118 |301 40 |18092 
553040 500 
GO:BP | biological process involved in | GO:0051701 | 0.009384390265 | 2.0275939396555 | 218 |301 15 | 18092 
interaction with host 544670 100 
GO:BP | ribosome biogenesis GO:0042254 | 0.010450542647 | 1.9808611581177 | 308 |301 18 | 18092 
043700 100 
GO:BP | establishment of protein GO:0070199 | 0.010593519529 | 1.9749597284580 |27 |301 6 (18092 
localization to chromosome 0166 700 
GO:BP | regulation of cell morphogenesis | GO:0010769 | 0.010635715728 | 1.9732332790096 |96 |301 10 | 18092 
involved in differentiation 557100 00 
GO:BP | protein targeting to lysosome GO:0061740 | 0.010857302265 | 1.9642780713239 | 3 301 3 | 18092 
involved in chaperone-mediated 331800 900 
autophagy 
GO:BP | hexitol metabolic process GO:0006059 | 0.010857302265 | 1.9642780713239 | 3 301 3 (18092 
331800 900 
GO:BP | positive regulation of anion GO:1903793 | 0.010953999021 | 1.9604273022011 | 507 |301 24 |18092 
transport 52080 100 
GO:BP | chaperone-mediated autophagy | GO:0061684 | 0.011032879674 | 1.9573111182289 |16 |301 5 | 18092 
9622 400 
GO:BP | regulation of macromolecule GO:0060255 | 0.011652552980 | 1.9335789137848 | 6626 |301 148 | 18092 
metabolic process 645200 20 
GO:BP | isoprenoid metabolic process GO:0006720 | 0.011731479928 | 1.9306471980928 | 143 |301 12 (18092 
536400 800 
GO:BP | cellular response to toxic GO:0097237 | 0.012461947829 | 1.9044140711941 | 120 |301 11 | 18092 
substance 696200 200 
GO:BP | DNA biosynthetic process GO:0071897 | 0.012826194237 | 1.8919021874823 | 196 |301 14 | 18092 
847000 700 
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GO:BP | transforming growth factor beta | GO:0071604 | 0.012830311479 | 1.8917628002114 |42 |301 7 | 18092 
production 064400 400 
GO:BP | tube development GO:0035295 | 0.013762388616 | 1.8613061829063 |1178 |301 41 | 18092 
073500 200 
GO:BP | regulation of peptide transport | GO:0090087 | 0.014155243223 | 1.8490826639429 | 623 |301 27 |18092 
342800 500 
GO:BP | regulation of vesicle-mediated |GO:0060627 | 0.014934894370 | 1.8257978446824 | 552 |301 25 | 18092 
transport 01000 800 
GO:BP | positive regulation of wound GO:0090303 | 0.015378285449 | 1.8130920820571 |60 |301 8 | 18092 
healing 114000 300 
GO:BP | cellular oxidant detoxification | GO:0098869 | 0.015394196067 | 1.8126429863611 |100 |301 10 |18092 
769600 700 
GO:BP | ATP metabolic process GO:0046034 | 0.015568959022 | 1.8077404244158 | 317 |301 18 | 18092 
630300 9 
GO:BP | epithelial cell differentiation GO:0030855 | 0.015619991936 | 1.8063191946533 | 817 |301 32 | 18092 
533400 400 
GO:BP | cellular response to stress GO:0033554 | 0.015741866920 | 1.8029437633881 | 2126 |301 62 | 18092 
49090 900 
GO:BP | oxoacid metabolic process GO:0043436 | 0.016032430029 | 1.7950006468068 | 1060 |301 38 | 18092 
339800 200 
GO:BP | ribose phosphate biosynthetic | GO:0046390 | 0.016194325791 | 1.7906371279908 |200 |301 14 | 18092 
process 654500 900 
GO:BP | positive regulation of GO:0032388 | 0.016201066158 | 1.7904564044878 | 228 |301 15 | 18092 
intracellular transport 933700 600 
GO:BP | movement in host environment | GO:0052126 | 0.018189420914 | 1.7401811270793 |175 |301 1з | 18092 
737400 100 
GO:BP | positive regulation of response to | GO:0048584 | 0.018318738611 | 1.7371044342237 |2374 | 301 67 | 18092 
stimulus 010100 400 
GO:BP | modulation of process of other | GO:0051817 | 0.018397089686 | 1.7352508744284 | 102 — |301 10 |18092 
organism involved in symbiotic 96650 600 
interaction 
GO:BP | heterotypic cell-cell adhesion | GO:0034113 | 0.019744972905 | 1.7045434578798 | 62 |301 8 | 18092 
316400 100 
GO:BP | endodermal cell differentiation | GO:0035987 | 0.020679808420 | 1.6844534888966 |45 |301 7 | 18092 
677000 700 
GO:BP | telomere maintenance via GO:0010833 | 0.020870226398 | 1.6804728397252 |82 |301 9 (18092 
telomere lengthening 161400 100 
GO:BP | daunorubicin metabolic process | GO:0044597 | 0.021096514025 | 1.6757893013158 |9 301 4 |18092 
541700 40 
GO:BP | polyketide metabolic process GO:0030638 | 0.021096514025 | 1.6757893013158 | 9 301 4 | 18092 
541700 40 
GO:BP | doxorubicin metabolic process | GO:0044598 | 0.021096514025 | 1.6757893013158 |9 301 4 | 18092 
541700 40 
GO:BP | activation of immune response | GO:0002253 | 0.022967743573 | 1.6388816692188 | 566 |301 25 | 18092 
78900 900 
GO:BP | ribonucleotide metabolic process | GO:0009259 | 0.022999400808 | 1.6382834782758 | 425 |301 21 | 18092 
967200 400 
GO:BP | formation of primary germ layer | GO:0001704 | 0.023228740653 | 1.6339743348340 |128 |301 11 | 18092 
978500 1 
GO:BP | phosphorylation GO:0016310 | 0.024896577259 | 1.6038603548906 |2157 |301 62 |18092 
556800 900 
GO:BP | modulation by host of symbiont | GO:0051851 | 0.025114650101 | 1.6000728680742 |64 |301 8 (18092 
process 464600 800 
GO:BP | purine ribonucleoside GO:0009205 | 0.028072665222 | 1.5517163534357 |85 — |301 9° [18092 
triphosphate metabolic process 346600 300 
GO:BP | cellular response to endogenous | GO:0071495 | 0.029589098367 | 1.5288682683611 | 1434 |301 46 |18092 


stimulus 


67530 


40 
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GO:BP | neural nucleus development GO:0048857 0.031665506500 | 1.4994135608907 | 66 301 8 18092 
72090 600 
GO:BP | organic acid metabolic pro GO:0006082 0.032761700974 |1.4846335579341 |1094 |301 38 18092 
13050 700 
GO:BP | tissue homeostasis GO:0001894 0.032925803302 | 1.4824636206029 | 272 301 16 18092 
3096 400 
СО:ВР е regulation of gene GO:0010629 0.034039855833 | 1.4680122879192 |2516 |301 69 18092 
expression 487100 200 
GO:BP | positive regulation of GO:1902533 | 0.034097268335 | 1.4672804126280 | 1054 |301 37 |18092 
intracellular signal transduction 683200 100 
GO:BP | positive regulation of GO:1904851 0.034700174013 | 1.4596683473171 | 10 301 4 18092 
establishment of protein 35630 700 
localization to telomere 
GO:BP | aminoglycoside antibiotic GO:0030647 0.034700174013 |1.4596683473171 |10 301 4 18092 
metabolic process 35630 700 
GO:BP | detoxification GO:0098754 0.035927070629 | 1.4445781922421 | 134 301 11 18092 
23230 700 
GO:BP | growth GO:0040007 0.036832032445 | 1.4337743154755 |1016 |301 36 18092 
724400 600 
GO:BP | regulation of endocytosis GO:0030100 0.036881183629 | 1.4331951495568 | 215 301 14 18092 
95760 400 
GO:BP | nucleoside triphosphate GO:0009142 | 0.037296303106 | 1.4283342142999 | 88 301 9 18092 
biosynthetic process 882300 200 
GO:BP | positive regulation of cell GO:0045785 0.037444043461 | 1.4266172593361 | 439 301 21 18092 
adhesion 91790 700 
GO:BP | positive regulation of GO:1902905 0.038856483704 | 1.4105365031917 | 216 301 14 18092 
supramolecular fiber organization 470800 300 
GO:BP | generation of precursor GO:0006091 0.039969880524 | 1.3982671498994 | 548 301 24 18092 
metabolites and energy 94100 200 
GO:BP | cellular response to chemical GO:0062197 0.040313132455 | 1.3945534545014 | 373 301 19 18092 
stress 99260 500 
GO:BP | nucleoside diphosphate GO:0006165 0.041324524491 | 1.3837921350216 | 136 301 11 18092 
phosphorylation 3301 00 
GO:BP | cornification GO:0070268 0.042233908469 | 1.3743387256318 | 112 301 10 18092 
042800 600 
СО:ВР | establishment or maintenance of | GO:0061245 0.042395013733 | 1.3726852197110 | 50 301 7 18092 
bipolar cell polarity 70410 400 
GO:BP | establishment or maintenance of | GO:0035088 | 0.042395013733 | 1.3726852197110 | 50 301 7 18092 
apical/basal cell polarity 70410 400 
GO:BP | actin filament fragmentation GO:0030043 0.042892616202 | 1.3676174634925 |4 301 3 18092 
39600 100 
СО:ВР | protein targeting to vacuole GO:0071211 0.042892616202 | 1.3676174634925 |4 301 3 18092 
involved in autophagy 39600 100 
GO:BP | pentose biosynthetic process GO:0019322 0.042892616202 | 1.3676174634925 | 4 301 3 18092 
39600 100 
GO:BP | telomere maintenance via GO:0007004 0.044152078971 | 1.3550488421490 | 69 301 8 18092 
telomerase 82920 100 
GO:BP | nucleotide phosphorylation GO:0046939 0.044277313889 | 1.3538187337340 | 137 301 11 18092 
7279 00 
GO:BP | regulation of DNA biosynthetic | GO:2000278 0.045662543824 | 1.3404398979626 | 113 301 10 18092 
process 95220 00 
GO:BP | purine ribonucleotide metabolic | GO:0009150 0.046821421780 | 1.3295554026454 | 411 301 20 18092 
pro 099900 500 
GO:BP | response to heat GO:0009408 0.047415130503 | 1.3240830497639 | 164 301 12 18092 
99180 100 
GO:BP | chaperone-mediated protein GO:0051131 0.048003181358 |1.3187299792774 |21 301 5 18092 
complex assembly 48880 900 
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GO:CC | extracellular exosome GO:0070062 9.061888786088 | 168.04278127209 | 2180 | 301 241 |18963 
37E-169 300 

GO:CC | extracellular ve: GO:1903561 3.877792795954 | 165.41141540051 | 2266 | 301 242 |18963 
88E-166 200 

GO:CC | extracellular organelle GO:0043230 4.825636291821 |165.31644541396 |2268 |301 242 |18963 
06Е-166 60 

СО:СС | extracellular space GO:0005615 1.674776814187 | 139.77604306016 | 3598 |301 259 |18963 
02Е-140 500 

СО:СС | extracellular region GO:0005576 8.333898306417 | 119.07915180328 | 4602 |301 264 | 18963 
05Е-120 300 

СО:СС | vesicle GO:0031982 4.862720984549 | 115.31312064886 | 4071 |301 251 |18963 
59Е-116 600 

СО:СС | focal adhesion GO:0005925 2.957487277617 | 49.529077114924 | 418 301 71 18963 
53E-50 

СО:СС | cell-substrate junction GO:0030055 1.168006731872 | 48.932554654135 | 426 301 71 18963 
46Е-49 70 

СО:СС | blood microparticle GO:0072562 5.062282102091 | 40.295653656892 | 141 301 43 18963 
94E-41 500 

GO:CC | anchoring junction GO:0070161 1.981572397464 | 38.702990055934 | 843 301 82 18963 
15E-39 000 

GO:CC | collagen-containing extracellular. | GO:0062023 9.778502935973 | 31.009727629508 | 424 301 55 18963 

matrix 61E-32 200 

СО:СС | membrane-bounded organelle GO:0043227 1.861923356889 | 29.730038200027 | 12836 | 301 287 |18963 
64Е-30 600 

GO:CC | melanosome GO:0042470 4.865536060626 | 29.312869304348 | 105 301 32 18963 
4E-30 

GO:CC | pigment granule GO:0048770 4.865536060626 | 29.312869304348 | 105 301 32 18963 
4E-30 

GO:CC | vesicle lumen GO:0031983 5.846556035771 | 29.233099883507 | 326 301 48 18963 
65E-30 50 

GO:CC | secretory granule lumen GO:0034774 3.823557826800 | 27.417532337233 | 320 301 46 18963 
9E-28 700 

СО:СС | cytoplasmic vesicle lumen GO:0060205 6.705731301117 | 27.173553852753 | 324 301 46 18963 
32E-28 300 

GO:CC | cytoplasm GO:0005737 1.593879055433 | 25.797544636233 | 11888 | 301 273 |18963 
44Е-26 90 

GO:CC | extracellular matrix GO:0031012 3.195563587246 | 25.495452536176 | 564 301 56 18963 
45E-26 600 

GO:CC | secretory granule GO:0030141 9.010046295868 | 25.045272977501 | 855 301 67 18963 
11E-26 700 

GO:CC | organelle GO:0043226 9.525745465113 | 25.021101027335 | 13974 | 301 292 |18963 
4E-26 600 

СО:СС | cell junction GO:0030054 2.485601150198 | 24.604568558785 | 2131 |301 105 | 18963 
17E-25 70 

GO:CC | cytosolic ribosome GO:0022626 9.043660244170 | 23.043655761768 | 109 301 28 18963 
22E-24 300 

СО:СС | cytoplasmic vesicle GO:0031410 1.386442546546 | 22.858098122649 | 2437 | 301 110 | 18963 
75Е-23 20 

СО:СС | intracellular vesicle GO:0097708 1.592691899975 | 22.797868228644 | 2441 | 301 110 |18963 
23Е-23 500 

СО:СС | cytosol GO:0005829 2.829697713830 | 22.548259956075 | 5231 | 301 169 |18963 
71Е-23 400 

СО:СС | protein-containing complex GO:0032991 2.936171609726 | 22.532218564907 | 5556 |301 175 | 18963 
51Е-23 500 

СО:СС | secretory vesicle GO:0099503 6.618203448773 |21.179259886417 | 1028 |301 68 18963 
04Е-22 40 

GO:CC | endoplasmic reticulum lumen GO:0005788 1.514556892720 | 17.819714408213 | 307 301 36 18963 


87E-18 


80 
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СО:СС | endomembrane system GO:0012505 | 6.934240711024 | 17.159001086582 | 4633 |301 147 |18963 
65Е-18 800 

GO:CC | ribosomal subunit GO:0044391 | 8.726372062741 |17.059166274119 | 191 |301 29 |18963 
92E-18 00 

GO:CC | intracellular non-membrane- GO:0043232 | 2.040074972907 | 16.690353871926 |4326 |301 140 |18963 
bounded organelle 85E-17 100 

GO:CC |non-membrane-bounded GO:0043228 | 2.378948360766 | 16.623614984981 |4333 |301 140 | 18963 
organelle 47E-17 50 

GO:CC_| organelle lumen GO:0043233 | 3.832084402111 | 15.416564934100 | 5468 | 301 159 |18963 
51E-16 9 

GO:CC |membrane-enclosed lumen GO:0031974 | 3.832084402111 | 15.416564934100 | 5468 |301 159 |18963 
51Е-16 9 

GO:CC |ribosome GO:0005840 | 9.863595675625 | 15.005964738464 |245 [301 30 |18963 
69E-16 600 

GO:CC | cytosolic large ribosomal subunit | GO:0022625 | 3.849356963320 | 14.414611813506 |57 |301 17 |18963 
14E-15 100 

GO:CC | supramolecular fiber GO:0099512 | 1.624579929441 | 12.789258916475 | 1006 |301 55 |18963 
72E-13 600 

GO:CC | ficolin-1-rich granule lumen GO:1904813 | 1.658259882883 |12.780347405717 | 124 — |301 21 |18963 
39E-13 30 

GO:CC |supramolecular polymer GO:0099081 | 2.282558636807 | 12.641578057029 | 1014 |301 55  |18963 
39E-13 200 

GO:CC | supramolecular complex GO:0099080 | 2.734509147600 |12.563120619465 |1337 |301 64 [18963 
89E-13 800 

GO:CC | intracellular anatomical structure | GO:0005622 | 2.855787156995 | 11.544274163838 | 14764 |301 283 |18963 
96Е-12 000 

СО:СС | ribonucleoprotein complex GO:1990904 | 5.061734658279 | 11.295700624777 | 693 |301 аз | 18963 
2Е-12 400 

GO:CC | endoplasmic reticulum GO:0005783 | 1.270710517715 | 10.895953375392 | 1975 |301 77 |18963 
19Е-11 600 

СО:СС | platelet alpha granule GO:0031091 | 1.608864486265 | 10.793480534736 |90 |301 17 |18963 
75E-11 000 

GO:CC | cell surface GO:0009986 | 2.375464685684 | 10.624251421538 |922 |301 49 |18963 
91E-11 700 

GO:CC | platelet alpha granule lumen GO:0031093 | 2.786775467335 | 10.554898021320 | 66 |301 15  |18963 
54E-11 90 

GO:CC | ficolin-1-rich granule GO:0101002 | 5.744889479074 | 10.240718321908 |184 [301 22 |18963 
88Е-11 600 

GO:CC | polysome GO:0005844 | 1.366834647315 |9.8642840209857 |73 |301 15 |18963 
988-10 8 

СО:СС | large ribosomal subunit GO:0015934 | 1.599479329157 | 9.7960213681407 | 119 |301 18 |18963 
Е-10 10 

СО:СС | intracellular organelle GO:0043229 | 3.075045566226 | 9.5121484444368 | 12431 |301 252 | 18963 
69E-10 00 

СО:СС | cytoskeleton GO:0005856 | 5.680361801063 |9.2456240017535 |2302 |301 81 |18963 
76Е-10 2 

СО:СС | lysosome GO:0005764 | 9.428226947386 | 9.0255699720947 | 709 |301 40 |18963 
76Е-10 30 

GO:CC | lytic vacuole GO:0000323 | 9.428226947386 | 9.0255699720947 | 709 |301 40 |18963 
76Е-10 30 

GO:CC | cytosolic small ribosomal subunit | GO:0022627 | 1.947010127890 | 8.7106317893758 |47 |301 12 |18963 
79Е-09 90 

GO:CC | membrane raft GO:0045121 | 9.099167017352 | 8.0409983633190 |336 |301 26 |18963 
64E-09 6 

СО:СС | membrane microdomain GO:0098857 | 9.720798868574 | 8.0122980426913 | 337 |301 26 |18963 
77E-09 00 

GO:CC | vacuole GO:0005773 | 1.321279102103 | 7.8790054339372 | 807 |301 41 |18963 
ПЕ-08 30 
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СО:СС | vacuolar lumen GO:0005775 1.321619098221 | 7.8788936941519 | 174 301 19 18963 
15Е-08 20 

GO:CC | intracellular organelle lumen GO:0070013 1.424363085887 | 7.8463792901250 | 5304 |301 137 |18963 
23Е-08 10 

СО:СС | cell cortex GO:0005938 1.578658108385 | 7.8017119154120 | 318 301 25 18963 
66E-08 80 

GO:CC | basement membrane GO:0005604 2.503140391111 | 7.6015147918719 | 103 301 15 18963 
43E-08 40 

GO:CC | basolateral plasma membrane GO:0016323 2.513788080330 | 7.5996713373988 | 249 301 22 18963 
22E-08 70 

GO:CC | membrane GO:0016020 3.366394959816 | 7.4728349320897 | 9813 | 301 210 |18963 
5Е-08 70 

СО:СС | small ribosomal subunit GO:0015935 5.824429597503 | 7.2347465998790 | 76 301 13 18963 
16Е-08 300 

СО:СС | actin cytoskeleton GO:0015629 5.882789193159 | 7.2304167140616 | 513 301 31 18963 
69E-08 20 

СО:СС | cortical cytoskeleton GO:0030863 9.624225668559 | 7.0166342022246 | 113 301 15 18963 
98E-08 00 

GO:CC | intracellular membrane-bounded | GO:0043231 1.596498798420 | 6.7968314039891 | 11250 | 301 229 |18963 

organelle 8E-07 80 

СО:СС | synapse GO:0045202 1.687969642201 | 6.7726353683400 | 1382 |301 54 18963 
61Е-07 30 

GO:CC | cell-cell junction GO:0005911 1.849497991843 | 6.7329461358986 | 506 301 30 18963 
43E-07 20 

GO:CC | filopodium GO:0030175 4.593551767715 | 6.3378513849148 | 107 301 14 18963 
22Е-07 60 

СО:СС | side of membrane GO:0098552 6.204926271280 | 6.2072633745465 | 632 301 33 18963 
86E-07 80 

СО:СС | polysomal ribosome GO:0042788 6.256377732604 | 6.2036770383635 | 34 301 9 18963 
45E-07 800 

СО:СС | actin-based cell projection GO:0098858 8.561450498173 | 6.0674526500440 | 222 301 19 18963 
07Е-07 60 

GO:CC | cell periphery GO:0071944 9.348911907115 | 6.0292389324680 | 5786 | 301 140 | 18963 
89E-07 00 

СО:СС | primary lysosome GO:0005766 9.678419599300 | 6.0141955533670 | 154 301 16 18963 
73Е-07 30 

СО:СС | azurophil granule GO:0042582 9.678419599300 | 6.0141955533670 | 154 301 16 18963 
73Е-07 30 

GO:CC | perinuclear region of cytoplasm | GO:0048471 0.000002518964 | 5.5987779606829 | 740 301 35 18963 
4555025500 20 

GO:CC | cell leading edge GO:0031252 0.000007477760 | 5.1262284698598 | 429 301 25 18963 
135245090 20 

GO:CC | polymeric cytoskeletal fiber GO:0099513 0.000008038282 | 5.0948367098875 | 775 301 35 18963 
962766630 60 

GO:CC | lamellipodium GO:0030027 0.000008810290 | 5.0550097769266 | 204 301 17 18963 
388779000 70 

СО:СС | endocytic vesicle GO:0030139 0.000009458758 | 5.0241658534029 | 313 301 21 18963 
703590170 20 

СО:СС | fibrinogen complex GO:0005577 0.000015624875 | 4.8061834327127 |8 301 5 18963 
562723000 20 

СО:СС | COP9 signalosome GO:0008180 0.000017726018 | 4.7513888136772 | 35 301 8 18963 
01416640 4 

СО:СС | cortical actin cytoskeleton GO:0030864 0.000023388734 | 4.6309932680250 | 83 301 11 18963 
931279800 80 

СО:СС | plasma membrane region GO:0098590 0.000026481801 | 4.5770524664378 | 1255 |301 46 18963 
967689900 50 

СО:СС | endoplasmic reticulum chaperone | GO:0034663 0.000068496259 | 4.1643331427896 |10 301 5 18963 


complex 


71768730 


00 
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GO:CC | plasma membrane GO:0005886 | 0.000088221876 |4.0544237085262 | 5663 |301 131 |18963 
57875230 20 

GO:CC | sarcolemma GO:0042383 | 0.000093472859 |4.0293144702342 | 138 |301 13 |18963 
72318110 30 

GO:CC | endocytic vesicle lumen GO:0071682 | 0.000145563161 | 3.8369485195962 |20 |301 6 |18963 
80314600 100 

GO:CC | platelet dense granule GO:0042827 | 0.000201079360 | 3.6966325044743 |21 |301 6 |18963 
57866900 80 

GO:CC | banded collagen fibril GO:0098643 | 0.000209718607 | 3.6783630342776 |12 |301 5 | 18963 
65504900 800 

GO:CC | fibrillar collagen trimer GO:0005583 | 0.000209718607 | 3.6783630342776 |12 |301 5 | 18963 
65504900 800 

GO:CC | cell trailing edge GO:0031254 | 0.000336369000 | 3.4731840352495 |13 |301 5 |18963 
51163100 300 

GO:CC | uropod GO:0001931 | 0.000336369000 | 3.4731840352495 |13 |301 5 | 18963 
51163100 300 

GO:CC | brush border GO:0005903 | 0.000385802708 | 3.4136347279243 | 109 |301 11 |18963 
34376300 2 

GO:CC | rough endoplasmic reticulum | GO:0005791 | 0.000437669880 |3.3588533392688 |89 |301 10 |18963 
80435400 700 

GO:CC | chaperone complex GO:0101031 | 0.000479137856 | 3.3195395139268 |24 |301 6 |18963 
96042400 900 

GO:CC | nucleosome GO:0000786 | 0.000505884420 |3.2959486954128 |112 |301 11 |18963 
2074210 100 

GO:CC | platelet dense granule lumen GO:0031089 | 0.000516450932 | 3.2869709337452 |14 |301 5 | 18963 
89807 70 

GO:CC | azurophil granule lumen GO:0035578 | 0.000537929165 | 3.2692749087107 |91 |301 10 |18963 
1843650 90 

GO:CC | cell body GO:0044297 | 0.000571334876 |3.2431092643951 |614 |301 27 |18963 
1225490 300 

GO:CC | microvillus GO:0005902 | 0.000595161303 | 3.2253653140209 |92 |301 10 |18963 
0924640 000 

GO:CC |DNA packaging complex GO:0044815 | 0.000921231437 | 3.0356312497180 | 119 |301 1 |18963 
9512540 3 

GO:CC | cell-cell contact zone GO:0044291 | 0.000947980188 | 3.0232007385643 |76 |301 9 |18963 
8673940 30 

GO:CC | lysosomal membrane GO:0005765 | 0.000971726850 | 3.0124557967127 | 381 |301 20 |18963 
6814700 000 

GO:CC | lytic vacuole membrane GO:0098852 | 0.000971726850 | 3.0124557967127 | 381 |301 20 |18963 
6814700 000 

GO:CC | invadopodium GO:0071437 | 0.001097763086 | 2.9594913769818 |16 |301 5 |18963 
3992200 6 

GO:CC | laminin-10 complex GO:0043259 | 0.001178602168 | 2.9286327642646 |3 301 3 |18963 
2850600 5 

GO:CC | external side of plasma GO:0009897 | 0.001179173233 |2.9284223877047 |420 |301 21 |18963 
membrane 1832700 80 

GO:CC | zona pellucida receptor complex | GO:0002199 | 0.001232965326 | 2.9090491365946 |8 301 4 |18963 
1724500 30 

GO:CC | high-density lipoprotein particle | GO:0034364 | 0.001581699295 | 2.8008760788559 |29 |301 6 |18963 
3487100 90 

GO:CC | ruffle GO:0001726 | 0.001876834244 | 2.7265740809990 | 180 |301 13 |18963 
8043500 7 

GO:CC | protein-DNA complex GO:0032993 | 0.002333406681 | 2.6320095628532 |212 |301 14 |18963 
772340 7 

GO:CC |chaperonin-containing T- GO:0005832 | 0.003606966466 | 2.4428578952964 |10 |301 4 |18963 
complex 051200 50 

GO:CC | cell projection membrane GO:0031253 | 0.003699645764 |2.4318398569960 | зав |301 18 |18963 
3953400 500 
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GO:CC | cluster of actin-based cell GO:0098862 | 0.003976024952 | 2.4005508987446 | 165 |301 12 |18963 
projections 519730 400 

GO:CC | basal plasma membrane GO:0009925 | 0.004485048633 | 2.3482328433208 | 51 301 7 18963 
638730 000 

GO:CC | very-low-density lipoprotein GO:0034361 | 0.004791328881 | 2.3195440177432 | 21 301 5 18963 
particle 252390 000 

GO:CC | complex of collagen trimers GO:0098644 | 0.004791328881 | 2.3195440177432 | 21 301 5 18963 
252390 000 

GO:CC | triglyceride-rich plasma GO:0034385 | 0.004791328881 | 2.3195440177432 | 21 301 5 18963 
lipoprotein particle 252390 000 

GO:CC | intercalated disc GO:0014704 | 0.005811734907 | 2.2356942035162 | 53 301 7 18963 
732240 60 

GO:CC | collagen trimer GO:0005581 | 0.006009765056 | 2.2211425057847 | 95 301 9 18963 
8621600 40 

GO:CC | plasma membrane protein GO:0098797 | 0.006044920440 | 2.2186094106770 | 699 |301 27 |18963 
complex 5499800 700 

GO:CC | lamellipodium membrane GO:0031258 | 0.006120305145 | 2.2132269242956 | 22 301 5 18963 
676950 20 

GO:CC | ruffle membrane GO:0032587 | 0.006540381951 | 2.1843968885896 |96 |301 9 18963 
4946900 800 

GO:CC | lysosomal lumen GO:0043202 | 0.007110277949 | 2.1481134218113 |97 301 9 18963 
8640900 300 

GO:CC | vacuolar membrane GO:0005774 | 0.007490172760 | 2.1255081651779 |438 |301 20 |18963 
91822 400 

GO:CC | plasma lipoprotein particle GO:0034358 | 0.008152538621 | 2.0887071351273 | 38 301 6 18963 
143190 90 

GO:CC | lipoprotein particle GO:1990777 | 0.008152538621 | 2.0887071351273 | 38 301 6 18963 
143190 90 

GO:CC | leading edge membrane GO:0031256 | 0.008483638691 | 2.0714178358545 | 178 |301 12 |18963 
772990 1 

GO:CC | cytoplasmic ribonucleoprotein | GO:0036464 | 0.008566489408 |2.0671971177198 |238 |301 14 |18963 
granule 807540 700 

GO:CC | intermediate-density lipoprotein | GO:0034363 | 0.011509896187 | 1.9389285934358 |5 301 3 18963 
particle 438800 70 

GO:CC | midbody GO:0030496 | 0.011755087719 | 1.9297741257122 | 184 |301 12 |18963 
423400 100 

GO:CC | protein-lipid complex GO:0032994 | 0.012758196726 | 1.8942107054693 | 41 301 6 18963 
801500 300 

GO:CC |actomyosin GO:0042641 | 0.012875502853 | 1.8902358005607 | 81 301 8 18963 
468700 800 

GO:CC | ribonucleoprotein granule GO:0035770 | 0.014047314347 | 1.8524066989168 | 249 [301 14 |18963 
916400 700 

GO:CC | chylomicron GO:0042627 | 0.016350442847 | 1.7864704800831 |14 |301 4 |18963 
800200 200 

GO:CC | microvillus membrane GO:0031528 | 0.017580773153 | 1.7549620297706 | 27 301 5 18963 
836900 800 

GO:CC | cell projection GO:0042995 | 0.019905887126 | 1.7010184628609 | 2352 |301 61 |18963 
672000 300 

GO:CC | membrane protein complex GO:0098796 | 0.020182207018 | 1.6950313433683 | 1326 |301 40 |18963 
775800 900 

GO:CC | nucleus GO:0005634 | 0.020797309378 | 1.6819928476165 |7565 | 301 153 |18963 
51130 700 

GO:CC | plasma membrane bounded cell | GO:0120025 | 0.021088749653 | 1.6759491687065 | 2255 |301 59 [18963 
projection 39870 400 

GO:CC | apical part of cell GO:0045177 | 0.027479787614 | 1.5609866281744 | 443 |301 19 |18963 
233800 500 

GO:CC | glutamatergic synapse GO:0098978 | 0.031083704017 | 1.5074672350870 |373 |301 17 |18963 
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СО:СС | basal рагі of cell GO:0045178 0.032762226916 | 1.4846265860134 | 69 301 7 18963 
378500 200 
GO:CC | adherens junction GO:0005912 0.034193225141 | 1.4660599341484 | 175 301 11 18963 
802600 100 
GO:CC | contractile actin filament bundle | GO:0097517 0.039308173408 | 1.4055171367213 | 71 301 7 18963 
39750 700 
GO:CC | stress fiber GO:0001725 0.039308173408 | 1.4055171367213 | 71 301 7 18963 
39750 700 
СО:СС | membrane attack complex GO:0005579 0.039342257486 | 1.4051407235792 | 7 301 3 18963 
36760 800 
GO:CC | contractile fiber GO:0043292 0.041218847329 | 1.3849041572794 | 242 301 13 18963 
429100 600 
GO:CC | microtubule cytoskeleton GO:0015630 0.045342873913 | 1.3434909569677 | 1283 |301 38 18963 
3215 30 
GO:CC | filopodium membrane GO:0031527 0.047537190080 | 1.3229664929586 | 18 301 4 18963 
77930 600 
KEGG | Coronavirus disease - COVID-19 | KEGG:05171 1.317376100476 | 14.880290219690 | 231 237 37 7987 
46E-15 10 
KEGG | Ribosome KEGG:03010 | 4.607017727499 | 11.336580116607 | 154 237 27 7987 
16E-12 700 
KEGG | Focal adhesion KEGG:04510 | 1.234021792792 | 10.908677170606 | 200 237 30 7987 
3E-11 000 
KEGG | ECM-receptor interaction KEGG:04512 | 6.853878733915 | 9.1640635838363 | 88 237 19 7987 
4E-10 10 
KEGG | Complement and coagulation KEGG:04610 | 3.551108694129 | 8.4496360343900 | 85 237 18 7987 
cascades 17E-09 00 
KEGG | Phagosome KEGG:04145 | 0.000006647513 | 5.1773407548900 | 148 237 19 7987 
762126080 8 
KEGG | Prion disease KEGG:05020 | 0.000012452610 | 4.9047396005396 | 273 237 26 7987 
364241100 80 
KEGG | Pathogenic Escherichia coli KEGG:05130 | 0.000031079191 | 4.5075302819959 | 196 237 21 7987 
infection 89699530 1 
KEGG  |Legionellosis KEGG:05134 | 0.000071133920 | 4.1479232522839 | 57 237 11 7987 
90934830 7 
КЕСС | Regulation of actin cytoskeleton | KEGG:04810 | 0.000168891072 | 3.7723933069756 |217 237 21 7987 
28294900 30 
КЕСС  |PI3K-Akt signaling pathway KEGG:04151 | 0.000174238473 | 3.7588559432122 | 353 237 28 7987 
18112600 500 
КЕСС | Systemic lupus erythematosus KEGG:05322 | 0.000182834016 | 3.7379430006186 | 133 237 16 7987 
259361 800 
КЕСС | Carbon metabolism KEGG:01200 | 0.000192267657 | 3.7160937651477 |118 237 15 7987 
39324700 900 
КЕСС | Proteoglycans іп сапсег KEGG:05205 | 0.000266367905 | 3.5745181046908 | 205 237 20 7987 
20887900 600 
KEGG | Salmonella infection KEGG:05132 | 0.000432636961 | 3.3638763804150 | 249 237 22 7987 
29060900 700 
КЕСС | Parkinson disease KEGG:05012 | 0.001484059319 | 2.8285487394436 | 249 237 21 7987 
6142000 200 
КЕСС | Platelet activation KEGG:04611 | 0.001723293612 | 2.7636407216641 | 124 237 14 7987 
740790 900 
КЕСС | Glycolysis / Gluconeogenesis KEGG:00010 | 0.002538304522 | 2.5954562764162 | 67 237 10 7987 
675390 600 
KEGG | Central carbon metabolism in KEGG:05230 | 0.003755708824 | 2.4253080854463 | 70 237 10 7987 
сапсег 2674600 300 
KEGG | Amoebiasis KEGG:05146 | 0.004190930706 | 2.3776895197746 | 101 237 12 7987 
738070 200 
KEGG | Protein digestion and absorption | KEGG:04974 | 0.005101480990 | 2.2923037273349 | 103 237 12 7987 
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KEGG | Gap junction KEGG:04540 | 0.005483845566 | 2.2609147841307 | 88 237 ll 7987 
2239 300 
KEGG | Biosynthesis of amino acids KEGG:01230 | 0.006891624708 | 2.1616783806038 | 75 237 10 7987 
36129 00 
KEGG | HIF-1 signaling pathway KEGG:04066 | 0.008933495535 | 2.0489785752759 | 109 237 12 7987 
899970 300 
KEGG | Tight junction KEGG:04530 | 0.013664862468 | 1.8643947349640 | 168 237 15 7987 
169600 500 
КЕСС | Leukocyte transendothelial KEGG:04670 | 0.013813343875 | 1.8597011765293 | 114 237 12 7987 
migration 65930 800 
KEGG | Amyotrophic lateral sclerosis KEGG:05014 | 0.018930164546 | 1.7228456110035 | 363 237 24 7987 
598100 000 
KEGG | Estrogen signaling pathway KEGG:04915 | 0.020990232265 | 1.6779827557442 | 137 237 13 7987 
536700 90 
КЕСС | Pentose phosphate pathway KEGG:00030 | 0.021604702371 | 1.6654517121903 | 30 237 6 7987 


727900 


600 


Table S 7. GO analysis of WI-38 cell-derived exosome. 


The analysis уус 


1s performed by using g:Profiler software. 


*The list of proteins associated with each ontology is available on request deanna.shea@vuw.ac.nz 
Source | Term name Term ID Adjusted Negative 10210 Term | Query | Inter | Effective 
p-value of adjusted size size secti | domain 
p-value on | size 
size 
СО:МЕ | structural molecule activity GO:0005198 2.412352006127 | 41.617559320427 | 698 307 80 18694 
33E-42 20 
GO:MF | extracellular matrix structural GO:0005201 5.207081651559 | 41.283405612156 | 169 307 47 18694 
constituent 05E-42 60 
GO:MF | cell adhesion molecule binding GO:0050839 1.650866912378 |33.782287936761 | 548 307 65 18694 
07E-34 20 
СО:МЕ | integrin binding GO:0005178 7.140091192197 |22.146296241443 | 148 307 31 18694 
45E-23 000 
СО:МЕ | signaling receptor binding GO:0005102 4.993562548682 | 18.301589505840 | 1681 |307 85 18694 
65E-19 2 
GO:MF | protein-containing complex GO:0044877 4.267379641734 | 17.369838718993 | 1282 |307 72 18694 
binding 46E-18 000 
СО:МЕ | cadherin binding GO:0045296 2.572955902416 | 11.589567657047 | 332 307 31 18694 
32E-12 100 
GO:MF | collagen binding GO:0005518 8.034958622142 | 11.095016355393 | 69 307 16 18694 
69E-12 700 
GO:MF | glycosaminoglycan binding GO:0005539 9.987825402491 | 11.000529058170 | 237 307 26 18694 
43E-12 600 
СО:МЕ | peptidase regulator activity GO:0061134 2.404839431816 |10.618913915632 | 225 307 25 18694 
91Е-П 700 
СО:МЕ | identical protein binding GO:0042802 2.707320471408 | 10.567460332804 | 2080 |307 81 18694 
26E-11 90 
СО:МЕ | endopeptidase inhibitor activity | GO:0004866 1.046071122823 | 9.0804387865969 | 179 307 22 18694 
9E-10 70 
GO:MF | peptidase inhibitor activity GO:0030414 1.850535470635 |9.7327025860329 | 184 307 22 18694 
77Е-10 40 
СО:МЕ | endopeptidase regulator activity | GO:0061135 2.882199598735 | 9.5402759466213 | 188 307 22 18694 
53E-10 3 
GO:MF | protease binding GO:0002020 6.906767771084 | 9.1607251461991 | 139 307 19 18694 
67E-10 50 
GO:MF | extracellular matrix structural GO:0030020 7.781708302130 | 9.1089250527856 | 41 307 12 18694 
constituent conferring tensile 49E-10 1 
strength 
GO:MF | extracellular matrix binding GO:0050840 1.862494465400 |8.7299050091297 | 55 307 13 18694 
74Е-09 30 
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GO:MF | calcium ion binding GO:0005509 | 1.942531072236 | 8.7116320256066 | 722 |307 4l |18694 
6E-09 00 
СО:МЕ | heparin binding GO:0008201 | 2.423870911406 | 8.6154905132007 |168 |307 20 |18694 
36Е-09 70 
GO:MF | proteoglycan binding GO:0043394 | 5.563997265902 | 8.2546130920882 |37 |307 11 [18694 
67E-09 70 
GO:MF | sulfur compound binding GO:1901681 | 6.127828276297 | 8.2126934136922 | 264 |307 24 |18694 
29E-09 70 
GO:MF | enzyme inhibitor activity GO:0004857 | 2.552930348800 | 7.5929610338226 | 386 |307 28 | 18694 
16E-08 10 
GO:MF | growth factor binding GO:0019838 | 5.121814720333 | 7.2905761360183 | 137 |307 17 | 18694 
09Е-08 800 
GO:MF | lipoprotein particle receptor GO:0070325 |9.175352771122 | 7.0373772291351 |26 |307 9 [18694 
binding 54E-08 00 
СО:МЕ | carbohydrate derivative binding | GO:0097367 | 1.797100443940 | 6.7454276485282 | 2286 |307 77 |18694 
44E-07 10 
GO:MF | enzyme binding GO:0019899 | 2.547870113993 | 6.5938227153530 | 1866 |307 67 | 18694 
48E-07 50 
СО:МЕ | protein binding GO:0005515 | 7.384154820285 | 6.1316992062277 | 14752 | 307 281 | 18694 
39E-07 10 
GO:MF | antioxidant activity GO:0016209 | 8.206025433856 | 6.0858671413093 | 87 |307 1з | 18694 
66E-07 40 
GO:MF | RNA binding GO:0003723 | 0.000003591225 | 5.4447573636442 | 1946 |307 66 |18694 
170743800 70 
СО:МЕ | serine-type endopeptidase GO:0004867 | 0.000003666739 | 5.4357198859972 | 98 |307 1 | 18694 
inhibitor activity 981099860 2 
СО:МЕ | platelet-derived growth factor | GO:0048407 | 0.000003910390 | 5.4077798782236 | 11 307 6 (1864 
binding 433680990 
GO:MF | structural constituent of GO:0005200 | 0.000007643345 | 5.1167165282549 | 104 |307 1з | 18694 
cytoskeleton 15548826 600 
GO:MF | enzyme regulator activity GO:0030234 | 0.000062807503 | 4.2019884680049 | 1228 | 307 46 | 18694 
61194520 8 
GO:MF | peroxidase activity GO:0004601 | 0.000088185976 | 4.0546004704770 |53 | 307 9 | 18694 
74907810 10 
СО:МЕ | oxidoreductase activity, acting on | GO:0016684 | 0.000169011587 | 3.7720835195794 |57 |307 9 | 18694 
peroxide as acceptor 26184100 00 
GO:MF | low-density lipoprotein particle | GO:0050750 | 0.000399814695 | 3.3981412470147 | 21 307 6 (1864 
receptor binding 5721960 500 
СО:МЕ | oxidoreductase activity, acting on | GO:0016616 | 0.000507591368 | 3.2944857717807 | 125 |307 12 | 18694 
the CH-OH group of donors, 3053100 300 
NAD or NADP as acceptor 
СО:МЕ | extracellular matrix structural | GO:0030021 | 0.000542184163 | 3.2658531719705 |22 |307 6 (1864 
constituent conferring 2269750 50 
compression resistance 
СО:МЕ | disordered domain specific GO:0097718 | 0.000661938276 | 3.1791825054214 |35 |307 7 | 18694 
binding 1019850 50 
GO:MF | unfolded protein binding GO:0051082 | 0.001047552736 | 2.9798241044401 | 111 |307 11 [18694 
1853700 40 
СО:МЕ | oxidoreductase activity, acting on | GO:0016614 | 0.001344381412 | 2.8714775005920 |137 |307 12 | 18694 
CH-OH group of donors 9681800 00 
GO:MF | oxidoreductase activity GO:0016491 | 0.002075069434 | 2.6829673666240 | 771 |307 31 | 18694 
6489600 000 
GO:MF se-bisphosphate aldolase | GO:0004332 | 0.002129390951 | 2.6717445957440 | 3 307 3 | 18694 
4240100 00 
СО:МЕ | actin binding GO:0003779 | 0.002337243398 | 2.6312960581666 | 445 |307 22 |18694 
6905500 50 
GO:MF | laminin binding GO:0043236 | 0.002519529095 | 2.5986806220637 | 28 |307 6 [18694 
4222200 80 
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СО:МЕ | heparan sulfate proteoglycan GO:0043395 0.002953662036 | 2.5296391989624 | 17 307 5 18694 
binding 5049400 000 
СО:МЕ | haptoglobin binding GO:0031720 0.006725776412 | 2.1722575741725 | 10 307 4 18694 
8521200 20 
СО:МЕ | small molecule binding GO:0036094 0.007935423020 | 2.1004299169687 | 2507 | 307 68 18694 
672960 200 
СО:МЕ | endopeptidase activity GO:0004175 0.010125395297 | 1.9945880128349 | 453 307 21 18694 
548900 700 
СО:МЕ | ion binding GO:0043 167 0.010160992932 | 1.9930638507108 | 6055 |307 134 | 18694 
480400 800 
СО:МЕ | nucleoside binding GO:0001882 0.015245897377 | 1.8168470078623 | 395 307 19 18694 
406000 400 
СО:МЕ | molecular carrier activity GO:0140104 0.016998441197 | 1.7695909027675 | 56 307 7 18694 
23440 500 
СО:МЕ | lipid binding GO:0008289 0.017830884676 | 1.7488271088337 | 779 307 29 18694 
69860 90 
СО:МЕ | cytoskeletal protein binding GO:0008092 0.021321313957 | 1.6711860347788 | 995 307 34 18694 
612000 300 
СО:МЕ | actin filament binding GO:0051015 0.022478870520 | 1.6482255142249 | 210 307 13 18694 
99580 600 
СО:МЕ | protein folding chaperone GO:0044183 0.027810851163 | 1.5557857191572 | 26 307 5 18694 
434700 000 
СО:МЕ | carbohydrate binding GO:0030246 0.029116869680 | 1.5358553173186 | 278 307 15 18694 
908000 000 
СО:МЕ | GTP binding GO:0005525 0.032080134210 | 1.4937638234727 | 382 307 18 18694 
225500 10 
СО:МЕ | serine-type peptidase activity GO:0008236 0.032723013046 | 1.4851467145032 | 188 307 12 18694 
16100 900 
СО:МЕ | purine ribonucleoside binding GO:0032550 0.036637249887 | 1.4360771333531 | 386 307 18 18694 
5066 400 
GO:ME | purine nucleoside binding GO:0001883 0.040421483285 | 1.3933877538812 | 389 307 18 18694 
96840 900 
СО:МЕ | ribonucleoside binding GO:0032549 0.040421483285 | 1.3933877538812 | 389 307 18 18694 
96840 900 
СО:МЕ | oxygen carrier activity GO:0005344 0.040966121014 | 1.3875751564184 | 15 307 4 18694 
79300 60 
СО:МЕ | serine hydrolase activity GO:0017171 0.042092426322 | 1.3757960396058 | 193 307 12 18694 
854600 300 
GO:BP | regulated exocytosis GO:0045055 4.240491780238 | 35.372583774289 | 799 308 80 18092 
8Е-36 60 
СО:ВР | exocytosis GO:0006887 3.316460460850 | 33.479325176033 | 921 308 83 18092 
37E-34 90 
GO:BP | platelet degranulation GO:0002576 3.771013705213 | 31.423541889209 | 127 308 37 18092 
35E-32 900 
СО:ВР | vesicle-mediated transport GO:0016192 8.035578411145 | 31.094982856706 | 2193 | 308 122 |18092 
92Е-32 800 
GO:BP | extracellular matrix organization | GO:0030198 1.060096340200 |30.974654664814 | 399 308 56 18092 
88E-31 10 
GO:BP | extracellular structure GO:0043062 1.215455797190 | 30.915260830638 | 400 308 56 18092 
organization 74Е-31 200 
СО:ВР | secretion Бу cell GO:0032940 1.679046494880 | 24.774937277542 | 1396 | 308 89 18092 
24E-25 50 
GO:BP | export from cell GO:0140352 2.71541613508E | 23.566163605736 | 1450 | 308 89 18092 
-24 200 
GO:BP | secretion GO:0046903 8.680603294288 | 23.061450090690 | 1538 |308 91 18092 
94E-24 40 
GO:BP | wound healing GO:0042060 2.594845695434 | 22.585888462714 | 553 308 55 18092 


ПЕ-23 


900 
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GO:BP | response to wounding GO:0009611 6.874812382467 | 22.162739149478 | 682 308 60 18092 
14E-23 900 

GO:BP | hemost: GO:0007599 4.464994832753 | 21.350179039375 | 352 308 44 18092 
67E-22 20 

GO:BP | organonitrogen compound GO:1901564 1.576564746102 | 20.802288189015 | 6780 |308 204 |18092 
metabolic process 12E-21 40 

GO:BP | blood coagulation GO:0007596 2.350606385210 | 20.628820088460 | 347 308 43 18092 
64E-21 200 

GO:BP | coagulation GO:0050817 5.937852932833 | 20.226370563097 | 355 308 43 18092 
32E-21 30 

GO:BP | biological adhesion GO:0022610 2.823183426151 | 18.549260904220 | 1501 |308 83 18092 
76Е-19 20 

GO:BP | response to organic substance GO:0010033 1.205491746211 | 17.918835758813 | 3459 |308 132 |18092 
56Е-18 700 

GO:BP | response to stress GO:0006950 1.608036967338 | 17.793703971429 | 4199 |308 148 | 18092 
08Е-18 400 

GO:BP | cell activation GO:0001775 1.762146550490 | 17.753957975933 |1510 |308 82 18092 
65Е-18 800 

GO:BP | transport GO:0006810 3.351080538961 | 17.474815134323 | 5259 | 308 169 | 18092 
38E-18 400 

GO:BP | cell adhesion GO:0007155 3.747956294414 | 17.426205482140 | 1494 |308 81 18092 
31E-18 000 

GO:BP | response to chemical GO:0042221 6.032366873099 | 17.219512253657 | 4834 | 308 160 | 18092 
89Е-18 200 

GO:BP | cellular response to chemical СО:0070887 7.261942537116 | 17.138947191929 | 3438 |308 130 | 18092 
stimulus 7E-18 900 

GO:BP  |establishment of localization GO:0051234 2.223693471608 | 16.652925078944 | 5398 |308 170 [18092 
13E-17 300 

GO:BP | establishment of localization in | GO:0051649 2.559235674327 | 16.591889719057 | 2840 |308 115 |18092 
сей 83E-17 200 

GO:BP | tissue development GO:0009888 2.667054628610 | 16.573968088674 | 2161 |308 98 18092 
26Е-17 2 

GO:BP | regulation of body fluid levels GO:0050878 6.917467005903 | 16.160052903627 | 520 308 46 18092 
29Е-17 300 

GO:BP | localization GO:0051179 1.062389762176 | 15.973716123077 | 6935 | 308 197 | 18092 
98Е-16 200 

GO:BP | regulation of biological quality GO:0065008 3.804660128133 | 15.419684132866 | 4194 |308 143 |18092 
66Е-16 800 

GO:BP | protein metabolic process GO:0019538 4.552907648658 | 15.341711158847 | 5806 | 308 175 |18092 
74Е-16 200 

GO:BP | neutrophil mediated immunity GO:0002446 5.231146096048 | 15.281403150777 | 498 308 44 18092 
94E-16 000 

GO:BP | immune effector process GO:0002252 5.537604435907 | 15.256678070157 | 1310 | 308 72 18092 
41E-16 500 

GO:BP | leukocyte mediated immunity GO:0002443 1.499275258770 | 14.824118625730 | 891 308 58 18092 
86Е-15 200 

GO:BP GO:0009653 1.735706811468 | 14.760523632221 | 2865 |308 112 |18092 
86Е-15 000 

GO:BP | biological process involved in GO:0044419 3.161874279990 | 14.500055402143 | 2272 |308 97 18092 
interaction between 93E-15 100 

organisms 

GO:BP  |neutrophil degranulation GO:0043312 7.234765076610 |14.140575566492 |484 308 42 18092 
98E-15 10 

GO:BP  |neutrophil activation involved in. | GO:0002283 9.076333683018 | 14.042089546058 | 487 308 42 18092 
immune response 85E-15 400 

СО:ВР | immune system proc: GO:0002376 9.748763196671 | 14.011050478756 | 3322 |308 121 |18092 


9Е-15 


000 
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GO:BP | neutrophil activation GO:0042119 | 2.377067715247 | 13.623958446400 | 500 |308 42 |18092 
44E-14 000 
GO:BP | myeloid leukocyte mediated GO:0002444 | 3.498183014611 | 13.456157473257 | 556 |308 44 |18092 
immunity 19E-14 400 
GO:BP | granulocyte activation GO:0036230 | 3.940553608988 | 13.404442759788 | 507 |308 42 |18092 
118-14 900 
СО:ВР | anatomical structure development | GO:0048856 | 4.665460196248 | 13.331105511438 | 6210 |308 178 |18092 
92Е-14 600 
СО:ВР | cell-substrate adhesion GO:0031589 | 4.945151131423 | 13.305820431156 |368 |308 36 |18092 
33E-14 700 
GO:BP | immune response GO:0006955 | 5.099196608349 | 13.292498242734 |2326 |308 9% |18092 
69E-14 40 
GO:BP | leukocyte degranulation GO:0043299 | 6.304976834826 | 13.200316504733 |539 |308 аз |18092 
75Е-14 30 
GO:BP | cellular localization GO:0051641 | 1.175034493147 | 12.929949384487 | 3562 |308 124 |18092 
34E-13 000 
СО:ВР |post-translational protein GO:0043687 | 2.055649831018 |12.687050863124 |362 |308 35 |18092 
modification 09Е-13 5 
GO:BP | cellular response to organic GO:0071310 | 3.465225712260 | 12.460268471760 |2812 |308 106 | 18092 
substance 74E-13 200 
GO:BP | myeloid cell activation involved | GO:0002275 | 6.862625138047 | 12.163509723231 | 549 |308 4 |18092 
in immune response 55Е-13 400 
GO:BP | response to stimulus GO:0050896 | 1.001847713811 | 11.999198288526 | 9469 |308 230 |18092 
28E-12 400 
GO:BP | cell activation involved in GO:0002263 | 1.129924798488 | 11.946950459780 | 723 |308 48 |18092 
immune response 98E-12 400 
GO:BP | leukocyte activation involved іп | GO:0002366 | 4.290101376684 | 11.367532445154 | 719 | 308 47 |18092 
immune response 38E-12 600 
GO:BP | endocytosis GO:0006897 | 7.705233777717 | 11.113214180199 | 701 |308 46 |18092 
77Е-12 000 
GO:BP |myeloid leukocyte activation GO:0002274 | 8.056377675400 | 11.093860182660 |673 |308 45 |18092 
83E-12 80 
GO:BP | platelet activation GO:0030168 | 1.013535133644 | 10.994161192127 | 160 |308 23 |18092 
54Е-11 900 
GO:BP | developmental process GO:0032502 | 1.072103970077 | 10.969763095761 | 6745 |308 182 |18092 
09Е-11 50 
GO:BP | negative regulation of proteolysis | GO:0045861 | 1.276792380544 | 10.893879717704 |366 |308 зз |18092 
48Е-11 900 
GO:BP | proteolysis GO:0006508 | 1.343230074497 | 10.871849593044 | 1846 |308 79 |18092 
73E-11 700 
GO:BP |regulation of proteolysis GO:0030162 | 2.572321850879 | 10.589674693089 | 754 |308 47 |18092 
65Е-11 900 
GO:BP | multicellular organism GO:0007275 | 2.880394340915 | 10.540548051004 | 5718 |308 162 |18092 
development 1Е-11 800 
GO:BP | cellular component organization | GO:0071840 | 3.772657040340 | 10.423352673266 | 6767 |308 181 |18092 
or biogenesis 79E-11 40 
GO:BP | cellular protein metabolic process | GO:0044267 | 4.738963889634 | 10.324316600562 | 5225 | 308 152 |18092 
828-11 500 
GO:BP | negative regulation of peptidase | GO:0010466 | 5.507099526323 | 10.259077074718 | 271 | 308 28 |18092 
activity 01E-11 600 
GO:BP | cellular component organization | GO:0016043 | 6.221765114407 | 10.206086388508 | 6576 |308 177 |18092 
54Е-11 900 
GO:BP | plasminogen activation GO:0031639 | 8.212499674968 | 10.085524634620 | 23 308 11 |18092 
09Е-11 500 
GO:BP | response to external stimulus GO:0009605 | 1.872801662372 |9.7275082138143 |2991 |308 104 |18092 
81E-10 10 
GO:BP | platelet aggregation GO:0070527 | 3.91773459981E | 9.4069649878772 |64 |308 15 |18092 


-10 


80 


268 


СО:ВР | regulation of developmental GO:0050793 | 5.941879338939 | 9.2260761716109 | 2648 |308 95 |18092 
process 29E-10 30 

GO:BP | multicellular organismal pr GO:0032501 | 8.362075169876 | 9.0776859327296 | 8065 |308 200 |18092 
25Е-10 70 

GO:BP | negative regulation of GO:0010951 | 8.729417704462 | 9.0590147249282 |258 |308 26 |18092 
48E-10 30 

GO:BP | protein activation cascade GO:0072376 | 1.209126906879 | 8.9175281143046 | 28 308 11 |18092 
468-09 50 

СО:ВР | blood coagulation, fibrin clot GO:0072378 | 1.209126906879 | 8.9175281143046 | 28 308 11 |18092 
formation 46E-09 50 

СО:ВР | zymogen activation GO:0031638 | 1.274865472857 | 8.8945356407031 | 57 308 14 |18092 
27E-09 20 

GO:BP | system development GO:0048731 | 2.539320282195 | 8.5952825184970 | 5151 |308 146 |18092 
16Е-09 70 

GO:BP | animal organ development GO:0048513 | 2.870821770919 | 8.5419937689485 | 3770 | 308 118 |18092 
9E-09 20 

GO:BP | regulation of anatomical structure | GO:0022603 | 3.285778890038 | 8.4833616649018 | 1062 | 308 5з | 18092 
morphogenesis 94Е-09 30 

GO:BP | cell morphogenesis GO:0000902 | 3.407806112642 | 8.4675251224161 | 1063 |308 5з |18092 
228-09 8 

СО:ВР | biological process involved in | GO:0044403 |3.573872865062 | 8.4468609009025 |996 |308 51 |18092 
symbiotic interaction 78E-09 00 

СО:ВР | negative regulation of hemostasis | GO:1900047 | 3.953634302597 | 8.4030035038915 |50 |308 13 |18092 
27E-09 9 

GO:BP | homotypic cell-cell adhesion GO:0034109 | 4.049665034314 | 8.3925808977150 | 88 308 16 |18092 
27E-09 00 

GO:BP | cell surface receptor signaling | GO:0007166 | 5.370591261850 | 8.2699778991004 | 3242 |308 106 |18092 
pathway 73E-09 90 

GO:BP | cell differentiation GO:0030154 | 5.670344406757 | 8.2463905620221 | 4440 |308 131 |18092 
34E-09 30 

GO:BP | cellular developmental pro GO:0048869 | 9.152132645521 | 8.0384776941016 |4518 |308 132 |18092 
6E-09 80 

GO:BP | anatomical structure formation | GO:0048646 | 9.836316725938 | 8.0071674955786 | 1235 | 308 57 | 18092 
involved in morphogenesis 83E-09 10 

GO:BP | positive regulation of biological | GO:0048518 | 9.995409007002 | 8.0001994300751 | 6466 | 308 169 |18092 
pro 78E-09 20 

GO:BP | fibrinolysis GO:0042730 | 1.152509845189 | 7.9383553560092 | 25 308 10 |18092 
66E-08 20 

СО:ВР | response to cytokine GO:0034097 | 1.234779793439 | 7.9084104861461 | 1242 |308 57 |18092 
5E-08 50 

GO:BP | catabolic process GO:0009056 | 1.337760056784 | 7.8736217754764 | 2746 |308 94 |18092 
56E-08 00 

GO:BP | regulation of peptidase activity | GO:0052547 | 1.346633722451 | 7.8707505141134 |469 |308 33 |18092 
32E-08 70 

GO:BP | supramolecular fiber organization | GO:0097435 | 1.627958615422 | 7.7883566395709 | 739 |308 42 | 18092 
45E-08 70 

GO:BP | regulation of hemostasis GO:1900046 | 1.725880188293 | 7.7629893565750 | 68 308 14 |18092 
8E-08 00 

GO:BP | negative regulation of GO:0050819 | 1.921519100908 | 7.7163552940629 |56 |308 13 |18092 
coagulation 87E-08 70 

GO:BP | collagen fibril organization GO:0030199 | 1.921519100908 | 7.7163552940629 |56 |308 13 |18092 
87E-08 70 

GO:BP | cellular response to cytokine GO:0071345 | 1.949749264382 | 7.7100212348407 | 1148 | 308 54 | 18092 
stimulus 93E-08 500 

GO:BP | cellular detoxification GO:1990748 | 1.978067811422 | 7.7037588241535 | 113 |308 17 |18092 
18E-08 700 

GO:BP | cell migration GO:0016477 | 2.591685195342 | 7.5864177521180 | 1684 |308 68 |18092 
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GO:BP | leukocyte activation GO:0045321 | 3.195987176705 | 7.4953949718791 | 1346 |308 59 |18092 
03E-08 40 

GO:BP | cell-matrix adhesion GO:0007160 | 4.202970641700 | 7.3764436435877 |236 |308 23 |18092 
38E-08 20 

GO:BP | humoral immune response GO:0006959 | 5.152138807576 | 7.2880124448422 |384 |308 29 |18092 
94E-08 2 

GO:BP | receptor-mediated endocytosis | GO:0006898 | 5.196595121491 | 7.2842811187268 |333 |308 27 |18092 
41E-08 50 

GO:BP | cellular response to toxic GO:0097237 | 5.334878128279 | 7.2728754972930 | 120 |308 17 |18092 
substance 33E-08 00 

GO:BP | сей morphogenesis involved іп | GO:0000904 | 5.794484145692 | 7.2369852210724 |769 |308 42 | 18092 
differentiation 53E-08 60 

GO:BP | movement of cell or subcellular | GO:0006928 | 6.059890991510 | 7.2175351880798 |2337 | 308 83 | 18092 
component 49E-08 00 

GO:BP | negative regulation of blood GO:0030195 | 6.288945647458 | 7.2014221586680 |49 |308 12 |18092 
coagulation 24E-08 10 

GO:BP | viral process GO:0016032 | 6.959593882166 | 7.1574161023262 |940 |308 47 |18092 
18E-08 50 

GO:BP | regulation of coagulation GO:0050818 | 7.055626324420 | 7.1514644280944 | 75 308 14 |18092 
45Е-08 3 

GO:BP | regulation of cellular component | GO:0051128 | 7.185951260449 |7.1435157327742 |2431 |308 85 |18092 
organization 8E-08 20 

GO:BP | blood vessel development GO:0001568 | 8.138747979359 | 7.0894423994691 | 810 |308 аз |18092 
24E-08 30 

GO:BP | cell motility GO:0048870 | 9.818887774385 | 7.0079377037409 | 1856 |308 71 |18092 
36E-08 100 

GO:BP | localization of cell GO:0051674 | 9.818887774385 | 7.0079377037409 | 1856 |308 71 |18092 
36E-08 100 

СО:ВР | hydrogen peroxide catabolic GO:0042744 | 9.824227725616 | 7.0077015791393 |30 |308 10 |18092 
process 65E-08 50 

GO:BP | negative regulation of catalytic | GO:0043086 | 1.285406226288 | 6.8909596007720 |855 |308 44 |18092 
activity 64E-07 90 

GO:BP | positive regulation of response to | GO:0048584 | 1.356581707622 | 6.8675540433318 |2374 | 308 83 [18092 
stimulus 5Е-07 60 

GO:BP regulation of biological | GO:0048519 | 1.386122316881 | 6.8581984441810 | 5833 |308 154 |18092 
45Е-07 900 

GO:BP | nitrogen compound metabolic |GO:0006807 | 1.546447204775 | 6.8106649020856 | 10057 | 308 226 |18092 
process 71E-07 1 

GO:BP | regulation of blood coagulation | GO:0030193 | 2.181354009977 | 6.6612738474412 | 67 308 1 |18092 
61E-07 00 

GO:BP | locomotion GO:0040011 | 2.187900964755 | 6.6599723402149 | 2055 | 308 75 | 18092 
7E-07 90 

GO:BP | response to oxygen-containing | GO:1901700 | 2.620021757064 | 6.5816951022170 |1774 | 308 68 |18092 
compound 23E-07 10 

GO:BP | detoxification GO:0098754 | 3.199230554410 | 6.4949544611037 | 134 |308 17 |18092 
08Е-07 4 

GO:BP | vasculature development GO:0001944 | 3.357807598660 | 6.4739441924249 | 847 |308 аз |18092 
34E-07 000 

СО:ВР | defense response GO:0006952 | 3.889226566679 | 6.4101367563144 | 1831 | 308 69 |18092 
71E-07 40 

GO:BP | negative regulation of hydrolase | GO:0051346 | 4.087276424006 | 6.3885659897998 | 475 | 308 31 |18092 
activity 06E-07 10 

СО:ВР | positive regulation of cellular | GO:0048522 | 4.516301958968 | 6.3452170289415 | 5863 |308 153 |18092 
ptocess 47E-07 60 

GO:BP | peptide cross-linking GO:0018149 | 5.567084936296 | 6.2543721526676 | 35 308 10 |18092 
02E-07 80 

GO:BP | organic substance catabolic GO:1901575 | 5.998975129184 | 6.2219229385760 |2271 | 308 79 |18092 
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GO:BP | tube morphogenesis GO:0035239 |7.913983113753 | 6.1016048807330 |975 |308 46 |18092 
77E-07 40 
GO:BP | gluco: bolic process to GO:0061718 | 9.570253282440 | 6.0190765682095 |27 |308 9 (18092 
pyruvate 6E-07 00 
GO:BP | NADH regeneration GO:0006735 | 9.570253282440 | 6.0190765682095 |27 |308 9 (18092 
6E-07 00 
GO:BP | canonical glycolysis GO:0061621 | 9.570253282440 | 6.0190765682095 |27 |308 9 [18092 
6E-07 00 
СО:ВР | negative regulation of response to | GO:1903035 | 0.000001052853 | 5.9776320931429 | 91 308 14 (18092 
wounding 4070578400 
GO:BP | regulation of endopeptidase GO:0052548 | 0.000001117154 | 5.9518869237808 |437 |308 29 |18092 
activity 0806624800 7 
GO:BP | regulation of response to GO:1903034 | 0.000001377792 | 5.8608160705157 |167 |308 18 |18092 
wounding 8589330900 90 
GO:BP | glycolytic process through GO:0061620 | 0.000001389405 | 5.8571711328394 |28 |308 9 (18092 
glucose-6-phosphate 0311605000 60 
GO:BP | regulation of complement GO:0030449 | 0.000001428646 | 5.8450753457933 | 110 |308 15 |18092 
activation 0810573800 20 
GO:BP | blood vessel morphogenesis GO:0048514 | 0.000001693865 | 5.7711209895428 | 723 |308 38 |18092 
8427961400 40 
GO:BP | metabolic process GO:0008152 | 0.000001957805 | 5.7082304898608 | 11998 |308 252 |18092 
3463505900 700 
GO:BP | glycolytic process through GO:0061615 | 0.000001984717 | 5.7023014081075 |29 |308 9 (18092 
fructose-6-phosphate 010303260 1 
GO:BP | regulation of humoral immune | GO:0002920 | 0.000002004438 | 5.6980071896321 | 131 |308 16 |18092 
response 8441270800 20 
GO:BP | regulation of cell motility GO:2000145 | 0.000002011081 | 5.6965704040322 | 1076 |308 48 |18092 
1528435200 9 
GO:BP | NAD metabolic process GO:0019674 | 0.000002364574 | 5.6262470645876 | 52 |308 11 | 18092 
1387389700 70 
GO:BP ulation of protein | GO:0051248 | 0.000002423629 | 5.6155337768179 | 1156 |308 50 18092 
metabolic process 4618727900 40 
GO:BP | biological regulation GO:0065007 | 0.000002511812 | 5.6000127502548 | 12801 | 308 263 |18092 
6872568800 50 
GO:BP | regulation of cell migration GO:0030334 | 0.000002980627 | 5.5256923517775 |1016 |308 46 |18092 
116899660 20 
GO:BP | negative regulation of cellular | GO:0048523 | 0.000003163825 | 5.4997874512042 | 5208 |308 138 |18092 
process 694600310 20 
GO:BP | regulation of multicellular GO:0051239 | 0.000003299344 | 5.4815723846690 |2894 |308 91 (18092 
organismal process 125456800 50 
СО:ВР | regulation of protein metabolic | GO:0051246 | 0.000004008874 | 5.3969775088327 |2722 |308 87 | 18092 
process 781808800 40 
GO:BP | complement activation GO:0006956 | 0.000004671883 | 5.3305079738119 | 180 |308 18 | 18092 
733368570 20 
GO:BP | response to other organism GO:0051707 | 0.000004777472 | 5.3208017768181 | 1610 | 308 61 |18092 
800596470 10 
СО:ВР | response to external biotic GO:0043207 | 0.000005011441 | 5.3000373472864 | 1612 | 308 61 |18092 
stimulus 357313400 70 
GO:BP | regulation of cell adhesion GO:0030155 | 0.000005063740 | 5.2955285200510 | 752 |308 38 |18092 
94923932 50 
GO:BP ubstance metabolic GO:0071704 | 0.000005637368 | 5.2489235762060 | 11403 | 308 242 |18092 
491616890 10 
GO:BP | regulation of catalytic activity | GO:0050790 | 0.000005874255 | 5.2310471771634 |2428 |308 80 | 18092 
3736053200 40 
GO:BP | regulation of locomotion GO:0040012 | 0.000007030088 | 5.1530392127346 | 1118 |308 ав | 18092 
418871180 10 
GO:BP | regulation of cellular component | GO:0051270 | 0.000007135246 | 5.1465910256988 | 1156 |308 49 |18092 
movement 38667975 90 
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GO:BP | regulation of localization GO:0032879 | 0.000008712599 | 5.0598522723845 | 2903 | 308 90 |18092 
033224100 20 

GO:BP | cellular response to stimulus GO:0051716 | 0.000009841847 | 5.0069233541403 | 7840 | 308 184 |18092 
82922226 80 

СО:ВР | NADH metabolic process GO:0006734 | 0.000010474558 | 4.9798642587862 |46 |308 10 |18092 
85195100 80 

GO:BP | regulation of cell morphogenesis | GO:0022604 | 0.000010637918 | 4.9731433348771 |312 |308 23 |18092 
656135100 40 

СО:ВР | response to biotic stimulus GO:0009607 | 0.000010873278 | 4.9636394986121 | 1645 |308 61 |18092 
237626100 80 

GO:BP | negative regulation of fibrinolysis | GO:0051918 | 0.000011497224 | 4.9394069781325 |10 |308 6 18092 
769855400 90 

СО:ВР | regulation of response to stimulus | GO:0048583 | 0.000011539694 | 4.9378056885587 | 4324 | 308 119 |18092 
497634300 90 

GO:BP | negative regulation of cellular | GO:0032269 | 0.000011930174 | 4.9233532090906 | 1099 | 308 47 |18092 
protein metabolic process 358916100 20 

СО:ВР | substrate adhesion-dependent cell | GO:0034446 | 0.000012006121 | 4.0205972819928 |109 [308 14 |18092 
spreading 056999200 30 

GO:BP | tube development GO:0035295 | 0.000013195553 | 4.8795723986810 | 1178 |308 4 |18092 
18057840 00 

GO:BP | negative regulation of wound GO:0061045 | 0.000014030232 | 4.8529351337284 |76 |308 12 |18092 
healing 446191100 50 

СО:ВР | response to endogenous stimulus | GO:0009719 | 0.000014239081 | 4.8465180127754 | 1698 |308 62 |18092 
874793300 10 

СО:ВР | defense response to other GO:0098542 | 0.000015437404 | 4.8114257041155 | 1222 |308 50 |18092 
organism 924300900 20 

GO:BP | cellular component assembly GO:0022607 | 0.000015539012 | 4.8085765971047 | 3030 |308 92 |18092 
029463100 5 

GO:BP | innate immune response GO:0045087 | 0.000016503385 | 4.7824269659555 |999 |308 44 |18092 
106302500 70 

GO:BP | glucose catabolic process GO:0006007 | 0.000016802314 | 4.7746308851684 |36 |308 9 18092 
709174900 60 

GO:BP | cellular catabolic process GO:0044248 | 0.000016803711 | 4.7745947729784 | 2350 |308 77 |18092 
903093000 60 

GO:BP | cellular component biogenesis | GO:0044085 | 0.000018295260 | 4.7376614018692 | 3276 |308 9 |18092 
52074870 00 

GO:BP | cell-cell adhesion GO:0098609 | 0.000023009320 | 4.6380962140315 | 901 |308 4 |18092 
101728200 00 

GO:BP | regulation of wound healing GO:0061041 | 0.000023016735 | 4.6379562703799 | 134 |308 15 |18092 
639481700 00 

GO:BP | cytoskeleton organization GO:0007010 | 0.000023706761 | 4.6251277724569 | 1435 | 308 55 |18092 
336458400 10 

GO:BP | response to oxygen levels GO:0070482 | 0.000023833367 | 4.6228145929366 |410 |308 26 |18092 
35690140 00 

GO:BP | regulation of cell-substrate GO:0010810 | 0.000023915889 | 4.6213134576151 |223 |308 19 |18092 
adhesion 732861700 60 

GO:BP | protein folding GO:0006457 | 0.000025734918 | 4.5894771993633 |224 |308 19 |18092 
702478100 30 

GO:BP | cell junction organization GO:0034330 | 0.000026350499 | 4.5792111507027 | 730 |308 36 |18092 
329238900 30 

GO:BP | cell development GO:0048468 | 0.000030683182 | 4.5130995932633 |2161 |308 72 |18092 
74934020 700 

GO:BP | regulation of cellular protein GO:0032268 | 0.000030983816 | 4.5088650930940 | 2563 |308 81 |18092 
metabolic proce: 143731900 00 

GO:BP | primary metabolic process GO:0044238 | 0.000033599701 | 4.4736645761888 | 10575 |308 227 |18092 
862043900 40 

GO:BP | cellular oxidant detoxification | GO:0098869 | 0.000036980971 | 4.4320216812008 | 100 | 308 1 |18092 
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70 
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СО:ВР | monosaccharide biosynthetic GO:0046364 | 0.000036980971 | 4.4320216812008 | 100 |308 13 |18092 
рго 737919200 70 
GO:BP | circulatory system development | GO:0072359 | 0.000039560727 | 4.4027357286284 | 1219 |308 49 |18092 
67640310 50 
СО:ВР | cytokine-mediated signaling GO:0019221 | 0.000040549601 | 4.3920134050565 | 812 |308 38 |18092 
pathway 91002180 10 
GO:BP | carbohydrate derivative GO:1901135 | 0.000046434654 | 4.3331577785468 | 1148 |308 47 |18092 
metabolic proi 815712000 50 
GO:BP |positive regulation of signal GO:0009967 | 0.000046560905 |4.3319785781438 | 1627 |308 5 |18092 
transduction 94244600 700 
GO:BP | regulation of plasminogen GO:0010755 | 0.000049151712 | 4.3084613463059 | 12 308 6 18092 
activation 495752900 20 
GO:BP | сей death GO:0008219 | 0.000052317616 | 4.2813520512717 |2322 |308 75 |18092 
33927300 700 
GO:BP | regulation of biological process | GO:0050789 | 0.000055000888 | 4.2596302977732 | 12145 |308 250 |18092 
11490720 20 
СО:ВР | ossification GO:0001503 | 0.000057252012 | 4.2422092405025 | 428 |308 26 |18092 
71289640 50 
GO:BP | regulation of vesicle-mediated | GO:0060627 | 0.000057430319 | 4.2408587651297 | 552 |308 30 |18092 
transport 82897170 500 
GO:BP | response to hypoxia GO:0001666 | 0.000061883034 | 4.2084283995290 |371 |308 24 |18092 
33364690 6 
СО:ВР | positive regulation of cell GO:0010647 | 0.000061974560 | 4.2077865447833 | 1808 |308 63 |18092 
communication 44536020 2 
GO:BP | macromolecule catabolic pro GO:0009057 | 0.000063983401 | 4.1939326751554 |1477 |308 55 |18092 
5621227 70 
GO:BP | regulation of hydrolase activity | GO:0051336 | 0.000066515590 | 4.1770765461113 |1318 |308 sı |18092 
9523129 10 
GO:BP | positive regulation of signaling | GO:0023056 | 0.000068725618 | 4.1628813459652 | 1813 |308 63 |18092 
03229660 60 
GO:BP | hydrogen peroxide metabolic GO:0042743 | 0.000078715603 | 4.1039391703791 |56 |308 10 |18092 
process 52819850 
GO:BP | angiogenesis GO:0001525 | 0.000083446836 | 4.0785901248737 | 627 |308 32 | 18092 
127456 20 
GO:BP | negative regulation of apoptotic | GO:0043066 | 0.000084830355 | 4.0714487106664 | 944 |308 41 |18092 
pro 95373110 20 
GO:BP | cellular metabolic process GO:0044237 | 0.000086588151 | 4.0625415313223 | 10855 |308 230 |18092 
56675730 90 
GO:BP | programmed cell death GO:0012501 | 0.000087182811 | 4.0595691312427 |2172 |308 71 |18092 
21460950 60 
GO:BP | ameboidal-type cell migration | GO:0001667 | 0.000088481443 | 4.0531477998992 |499 |308 28 |18092 
69045540 00 
GO:BP | hexose catabolic process GO:0019320 | 0.000094038811 | 4.0266928679384 | 57 308 10 |18092 
57812 80 
GO:BP | regulation of apoptotic process | GO:0042981 | 0.000100166993 | 3.9992753620818 | 1579 |308 57 |18092 
32563200 500 
GO:BP | activation of immune response | GO:0002253 | 0.000100322816 | 3.9986002830337 | 566 |308 30 [18092 
67646200 70 
СО:ВР | response to growth factor GO:0070848 | 0.000106008234 | 3.9746603991021 | 771 |308 36 |18092 
31138500 800 
GO:BP | regulation of cell death GO:0010941 | 0.000117168267 | 3.9311899928753 | 1754 |308 61 |18092 
18125100 000 
GO:BP GO:0036293 | 0.000124950922 | 3.9032605335852 |385 |308 24 |18092 
38155100 400 
GO:BP | negative regulation of molecular | GO:0044092 | 0.000127803913 | 3.8934558466015 | 1226 |308 48 |18092 
function 73116300 400 
GO:BP | response to toxic substance GO:0009636 | 0.000142000805 | 3.8477091935172 |249 |308 19 |18092 


02782400 


900 
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GO:BP | regulation of fibrinolysis GO:0051917 | 0.000155210641 | 3.8090785047338 |14 |308 6 18092 
9897630 800 

GO:BP | negative regulation of GO:0043069 | 0.000164202038 | 3.7846214559578 |967 |308 4 |18092 
programmed cell death 3638810 000 

GO:BP | establishment or maintenance of | GO:0007163 | 0.000178417291 | 3.7485630571128 |227 |308 18 |18092 
cell polarity 78739000 000 

GO:BP | epithelium development GO:0060429 | 0.000185491943 | 3.7316749489613 | 1361 |308 51 |18092 
26796200 50 

GO:BP | regulation of programmed сей | GO:0043067 | 0.000212475720 | 3.6726906885707 | 1614 |308 57 |18092 
death 90734400 40 

GO:BP | regulation of signaling GO:0023051 | 0.000222756709 | 3.6521692058367 | 3636 |308 101 |18092 
58980900 900 

GO:BP | mucopolysaccharide metabolic | GO:1903510 | 0.000224334782 | 3.6491033855144 | 116 |308 1 |18092 
process 24148900 80 

GO:BP | glycosaminoglycan metabolic | GO:0030203 | 0.000233288727 | 3.6321062452804 | 159 [308 15 |18092 
process 77241200 40 

GO:BP | response to nutrient GO:0007584 | 0.000251559452 | 3.5993593591488 | 183 |308 16 |18092 
5014710 00 

GO:BP | regulation of signal transduction | GO:0009966 | 0.000253021636 | 3.5968423400169 | 3207 | 308 92 |18092 
2868990 000 

GO:BP | negative regulation of GO:0051241 | 0.000253512001 | 3.5960014757810 | 1175 |308 46 |18092 
multicellular organismal process 5862920 500 

GO:BP GO:0046365 | 0.000254315152 | 3.5946277636656 | 63 308 10 |18092 
16852500 60 

GO:BP | regulation of molecular function | GO:0065009 | 0.000255198539 | 3.5931218156669 | 2969 |308 87 |18092 
3484040 60 

GO:BP | regulation of immune system GO:0002682 | 0.000267521994 | 3.5726405067103 | 1625 |308 57 |18092 
ргосевв 2861290 000 

GO:BP | regulation of cell communication | GO:0010646 | 0.000268176469 | 3.5715793315830 | 3600 | 308 100 | 18092 
1020990 100 

СО:ВР | protein localization to membrane | GO:0072657 | 0.000271282274 | 3.5665785820511 | 660 |308 32 |18092 
454716 500 

GO:BP | negative regulation of cell death | GO:0060548 | 0.000307376994 | 3.5123286408148 | 1066 | 308 аз |18092 
0684430 800 

GO:BP | regulation of multicellular GO:2000026 | 0.000358134020 | 3.4459544222864 | 1472 |308 53 |18092 
organismal development 0652830 500 

GO:BP | positive regulation of cell GO:2000147 | 0.000359273062 | 3.4445753443102 | 600 |308 30 |18092 
motility 25560400 900 

GO:BP | reactive oxygen species GO:0072593 | 0.000417575923 | 3.3792645500756 |294 |308 20 |18092 
metabolic process 17614700 900 

GO:BP | cellular response to growth factor | GO:0071363 | 0.000417763145 | 3.3790698760462 |743 |308 34 |18092 
stimulus 01111900 30 

GO:BP | glucose metabolic process GO:0006006 | 0.000461249458 | 3.3360641304048 |216 |308 17 |18092 
80058700 200 

GO:BP | regulation of cell differentiation | GO:0045595 | 0.000469878358 | 3.3280145573696 | 1695 | 308 58 |18092 
21488100 900 

СО:ВР | positive regulation of transport | GO:0051050 | 0.000472521728 | 3.3255782163444 |930 | 308 39 [18092 
1711660 00 

СО:ВР | tissue migration GO:0090130 | 0.000513736065 | 3.2892599447774 |385 |308 23 |18092 
4018730 700 

СО:ВР | mRNA catabolic process GO:0006402 | 0.000537973346 | 3.2692392408821 | 386 |308 23 |18092 
1583860 00 

GO:BP | positive regulation of cell GO:0030335 | 0.000545985728 | 3.2628187094646 | 578 |308 29 |18092 
migration 1091140 20 

GO:BP | positive regulation of cellular | GO:0051272 | 0.000569241834 | 3.2447031906457 |613 |308 30 |18092 
component movement 1953790 60 

GO:BP | positive regulation of locomotion | GO:0040017 | 0.000610226827 | 3.2145087033511 |615 [308 30 |18092 


4504880 


00 
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GO:BP | negative regulation of response to | GO:0048585 | 0.000620635679 | 3.2071632612788 |1752 |308 59 | 18092 
stimulus 1527630 30 
GO:BP | organic cyclic compound GO:1901361 | 0.000623084357 | 3.2054531516694 | 650 |308 31 | 18092 
catabolic process 3265060 200 
GO:BP | phagocytosis GO:0006909 | 0.000645856110 | 3.1898642273076 | 390 |308 23 | 18092 
2947350 80 
GO:BP | cell projection organization GO:0030030 | 0.000661080706 | 3.1797455176584 |1627 |308 56 [18092 
1770030 800 
GO:BP | apoptotic proc GO:0006915 | 0.000668839392 | 3.1746781563131 |2018 |308 65 [18092 
4189630 0 
GO:BP | plasma membrane bounded cell | GO:0120036 | 0.000679040113 | 3.1681045694600 | 1586 |308 55 (18092 
projection organization 5991510 20 
GO:BP | tissue morphogenesis GO:0048729 | 0.000690131380 | 3.1610682246957 | 688 |308 32 | 18092 
3996310 900 
GO:BP | response to inorganic substance | GO:0010035 | 0.000701547555 | 3.1539428840386 | 585 |308 29 |18092 
9318280 90 
GO:BP | aminoglycan metabolic process | GO:0006022 | 0.000707030751 | 3.1505616964710 |173 |308 15 | 18092 
7860530 700 
GO:BP | negative regulation of cell- GO:0010812 | 0.000710587501 | 3.1483824358687 |70 |308 10 |18092 
substrate adhesion 0124340 40 
GO:BP | positive regulation of cellular | GO:0051130 | 0.000784077827 |3.1056408271465 |1182 |308 45 |18092 
component organization 4913620 70 
GO:BP | positive regulation of immune | GO:0002684 | 0.000912825503 | 3.0396122347452 |1070 |308 42 |18092 
system process 2668870 400 
GO:BP | negative regulation of GO:0010605 | 0.000924729142 | 3.0339854554440 |3297 |308 92 [18092 
macromolecule metabolic 6706020 000 
pro 
GO:BP | inflammatory response GO:0006954 | 0.001000467049 | 2.9997972104449 | 808 |308 35 | 18092 
240110 10 
GO:BP | regulation of immune response | GO:0050776 | 0.001050515844 | 2.9785973930143 | 998 |308 40 |18092 
2788000 70 
GO:BP | protein processing GO:0016485 | 0.001127312292 | 2.9479557573858 | 230 |308 17 | 18092 
3165200 900 
GO:BP | nucleobase-containing compound | GO:0034655 | 0.001185650419 | 2.9260433408461 | 566 |308 28 | 18092 
catabolic process 1098800 700 
GO:BP | hexose metabolic process GO:0019318 | 0.001210151573 | 2.9171602301072 | 258 |308 18 | 18092 
6736100 10 
GO:BP | positive regulation of response to | GO:1903036 | 0.001211814985 | 2.9165636812698 |74 |308 10 [18092 
wounding 445960 80 
GO:BP | heterocycle catabolic process GO:0046700 | 0.001224194941 | 2.9121494192927 | 601 |308 29 |18092 
7538700 400 
GO:BP | integrin-mediated signaling GO:0007229 | 0.001275141478 | 2.8944416270805 |113 |308 12 | 18092 
pathway 425060 5 
GO:BP | membrane organization GO:0061024 | 0.001306359500 | 2.8839372919099 | 968 |308 39 | 18092 
6949300 000 
GO:BP | signal transduction GO:0007165 | 0.001346331783 | 2.8708479015750 | 6353 |308 150 | 18092 
2321700 000 
GO:BP | epithelial cell migration GO:0010631 | 0.001353933273 | 2.8684027387640 | 376 |308 22 | 18092 
1535600 900 
GO:BP | protein targeting to ER GO:0045047 | 0.001403730605 | 2.8527162309290 | 114 |308 12 | 18092 
9690800 900 
GO:BP | cellular nitrogen compound GO:0044270 | 0.001450304042 | 2.8385409423615 | 606 |308 29 |18092 
catabolic process 9534800 30 
GO:BP | small molecule metabolic process | GO:0044281 | 0.001461230130 | 2.8352813812762 | 1929 |308 62 | 18092 
361330 000 
GO:BP | response to topologically GO:0035966 | 0.001543598316 | 2.8114657038887 | 209 |308 16 |18092 
incorrect protein 0299400 40 
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GO:BP | collagen metabolic process GO:0032963 | 0.001543709283 | 2.8114344840588 | 115 |308 12 |18092 
6131800 100 
GO:BP | epithelium migration GO:0090132 | 0.001547885996 | 2.8102610286439 |379 |308 22 |18092 
8417500 300 
СО:ВР | response to organonitrogen GO:0010243 | 0.001640240759 | 2.7850924001333 | 1094 |308 42 |18092 
compound 8110600 000 
СО:ВР | negative regulation of metabolic | GO:0009892 | 0.001717950627 | 2.7649893215055 | 3539 |308 96 |18092 
proc 8526200 200 
GO:BP | macromolecule metabolic GO:0043170 | 0.001746125166 | 2.7579246283177 | 9885 |308 210 | 18092 
process 1453400 900 
GO:BP | SRP-dependent cotranslational | GO:0006614 | 0.001775632870 | 2.7506468238468 | 96 |308 11 |18092 
protein targeting to membrane 7855000 200 
GO:BP | response to nitrogen compound | GO:1901698 | 0.001813930229 | 2.7413794215638 | 1178 | 308 44 | 18092 
3792800 6 
GO:BP | positive regulation of wound GO:0090303 | 0.001843902788 | 2.7342619788790 |60 |308 9 18092 
healing 6167200 300 
GO:BP | glycosaminoglycan catabolic GO:0006027 | 0.001843902788 | 2.7342619788790 |60 |308 9 18092 
process 6167200 300 
СО:ВР | response to unfolded protein GO:0006986 | 0.001924972953 | 2.7155753680275 | 187 |308 15 |18092 
7864600 900 
GO:BP | endodermal cell differentiation | GO:0035987 | 0.002008065773 | 2.6972220661068 | 45 308 8 18092 
599890 70 
GO:BP | establishment of protein GO:0072599 | 0.002040933098 | 2.6901712311850 | 118 |308 12 |18092 
localization to endoplasmic 394610 800 
reticulum 
GO:BP | protein-containing complex GO:0065003 | 0.002201896017 | 2.6572031940232 | 1689 |308 56 |18092 
assembly 5580300 30 
СО:ВР | anatomical structure homeostasis | GO:0060249 | 0.002208732779 | 2.6558568235144 |483 |308 25 |18092 
28182 00 
GO:BP | cellular response to endogenous | GO:0071495 | 0.002230088548 | 2.6516778923968 | 1434 |308 50 |18092 
stimulus 4911400 60 
GO:BP | regulation of endocytosis GO:0030100 | 0.002252675376 | 2.6473013881783 |215 |308 16 |18092 
114990 70 
GO:BP | signaling GO:0023052 | 0.002636365220 | 2.5789944263341 | 6864 | 308 158 |18092 
297690 100 
GO:BP | protein maturation GO:0051604 | 0.002796814131 |2.5533363947050 |302 |308 19 |18092 
3040300 700 
GO:BP | endoderm development GO:0007492 | 0.002852519790 | 2.5447713338690 | 81 308 10 |18092 
0937000 200 
GO:BP | RNA catabolic process GO:0006401 | 0.002881043212 | 2.5404502277930 | 425 |308 23 |18092 
0688400 600 
GO:BP | cotranslational protein targeting | СО:0006613 | 0.002967720373 | 2.5275770218416 | 101 |308 11 |18092 
to membrane 5265000 200 
GO:BP | response to abiotic stimulus GO:0009628 | 0.002996458831 | 2.5233916846635 | 1282 |308 46 |18092 
8924600 6 
GO:BP | negative regulation of GO:0010757 | 0.003007355073 | 2.5218152924738 | 6 308 4 18092 
plasminogen activation 212300 200 
GO:BP | skeletal system development GO:0001501 | 0.003035338390 | 2.5177928851080 |525 |308 26 |18092 
9354 800 
GO:BP | сей communication GO:0007154 | 0.003161606005 | 2.5000922521831 | 6882 |308 158 |18092 
224460 300 
GO:BP | nuclear-trans GO:0000184 | 0.003189891220 | 2.4962241266381 | 123 |308 12 |18092 
catabolic pro 992520 800 
mediated decay 
GO:BP | homeostatic process GO:0042592 | 0.004052683674 | 2.3922572931240 | 2033 | 308 63 |18092 
001510 700 
GO:BP | positive regulation of GO:1902533 | 0.004157374787 |2.3811808220411 | 1054 |308 40 |18092 
intracellular signal transduction 721800 900 
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GO:BP | monosaccharide metabolic GO:0005996 | 0.004163504570 | 2.3805409542935 |281 |308 18 |18092 
pro 153030 100 
GO:BP |aminoglycan catabolic proc GO:0006026 | 0.004221896674 | 2.3744923997294 |66 — |308 9 (18092 
1948400 800 
GO:BP | actin filament-based process GO:0030029 | 0.004418499119 | 2.3547252272195 | 824 |308 34 (18092 
452950 10 
GO:BP | positive regulation of blood GO:0030194 | 0.004584470819 | 2.3387107872413 |23 |308 6 (18092 
coagulation 51217 400 
GO:BP | positive regulation of hemost GO:1900048 | 0.004584470819 | 2.3387107872413 |23 |308 6 (18092 
51217 400 
GO:BP | morphogenesis ofa polarized | GO:0001738 | 0.004796799433 | 2.3190484401645 |151 |308 13 |18092 
epithelium 027550 80 
GO:BP | formation of primary germ layer | GO:0001704 | 0.004878245867 | 2.3117363147009 |128 |308 12 |18092 
274270 300 
GO:BP | positive regulation of immune | GO:0050778 | 0.004897308853 | 2.3100425059610 |753 |308 32 | 18092 
response 018050 00 
GO:BP | regulation of G2/M transition of | GO:0010389 | 0.005090269109 | 2.2932592569804 | 202 |308 15 | 18092 
mitotic cell cycle 946160 90 
GO:BP | retinoid metabolic process GO:0001523 | 0.005285100720 | 2.2769467317156 | 107 |308 11 | 18092 
745520 500 
GO:BP | carboxylic acid metabolic process | GO:0019752 | 0.005597950977 | 2.2519709090277 | 1027 | 308 39 | 18092 
998560 600 
GO:BP | actin cytoskeleton organization | GO:0030036 | 0.005956701846 | 2.2249941372958 | 723 |308 31 | 18092 
501090 00 
GO:BP | positive regulation of coagulation | GO:0050820 | 0.006025529154 | 2.2200048078444 | 24 |308 6 (18092 
928420 800 
GO:BP | triglyceride-rich lipoprotein GO:0034370 | 0.006108358417 | 2.2140754879596 |14 |308 5 | 18092 
particle remodeling 488380 60 
GO:BP | humoral immune response GO:0002455 | 0.006431521709 | 2.1916862600893 | 155 |308 1з | 18092 
mediated by circulating 203480 900 
immunoglobulin 
GO:BP | enzyme linked receptor protein | GO:0007167 | 0.006706443434 | 2.1735077340965 |1115 |308 41 | 18092 
signaling pathway 8284200 60 
GO:BP | viral life cycle GO:0019058 | 0.006728169413 |2.1721030816710 | 351 |308 20 |18092 
235880 30 
GO:BP | apoptotic cell clearance GO:0043277 | 0.007336883443 | 2.1344883802374 |53 |308 8 (18092 
483930 600 
GO:BP |sulfur compound catabolic GO:0044273 | 0.007336883443 | 2.1344883802374 |53 |308 8 (18092 
process 483930 600 
GO:BP | aromatic compound catabolic | GO:0019439 | 0.007508636574 | 2.1244389154448 |622 |308 28 |18092 
process 581980 20 
GO:BP | establishment of protein GO:0072594 | 0.008122198258 | 2.0903264138221 | 589 |308 27 |18092 
localization to organelle 48554 70 
GO:BP | gluconeogenesis GO:0006094 | 0.008400849623 | 2.0756767891763 | 91 308 10 [18092 
179330 30 
GO:BP | endoderm formation GO:0001706 | 0.008486707517 | 2.0712607649325 |54 — |308 8 (18092 
197090 900 
GO:BP | positive regulation by host of | 60:0044794 | 0.009035042832 | 2.0440697842344 |15 — |308 5 (18092 
viral process 509050 000 
GO:BP | diterpenoid metabolic proc GO:0016101 | 0.009059329775 | 2.0429039309920 |113 |308 11 [18092 
105290 900 
GO:BP | leukocyte migration GO:0050900 | 0.009180964282 | 2.0371117021680 | 523 |308 25 | 18092 
718370 10 
GO:BP | axonogenesis GO:0007409 | 0.009229712350 | 2.0348118338173 | 489 |308 24 |18092 
347430 700 
GO:BP | cell population proliferation GO:0008283 | 0.010322604325 | 1.9862107192403 |2091 |308 63 [18092 
246300 900 
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GO:BP | hexose biosynthetic process GO:0019319 | 0.011298676232 | 1.9469724360561 |94 |308 10 |18092 
041100 300 

GO:BP | oxoacid metabolic pro GO:0043436 | 0.011896038048 | 1.9245976554222 | 1060 |308 зә |18092 
677300 000 

СО:ВР | carbohydrate biosynthetic GO:0016051 | 0.012313727183 | 1.9096104726544 |217 |308 15 |18092 
process 072100 700 

GO:BP | developmental growth involved | GO:0060560 | 0.012370443592 | 1.9076147266923 | 245 |308 16 |18092 
in morphogenesis 68920 800 

СО:ВР | biological process involved in | GO:0051702 | 0.012440158956 | 1.9051740703449 | 95 308 10 |18092 
interaction with symbiont 051900 500 

GO:BP | acute inflammatory response GO:0002526 | 0.012729497004 | 1.8951887568087 | 117 |308 11 |18092 
03810 600 

GO:BP | positive regulation of heterotypic | GO:0034116 | 0.012959081575 | 1.8874257763135 |16 |308 5 18092 
cell-cell adhesion 304600 000 

GO:BP | complement activation, GO:0006957 | 0.012959081575 | 1.8874257763135 |16 |308 5 18092 
alternative pathway 304600 000 

GO:BP | axon development GO:0061564 | 0.013209381050 | 1.8791175315565 |534 |308 25 |18092 
731000 300 

GO:BP | regulation of cell morphogenesis | GO:0010769 | 0.013680220128 | 1.8639069143165 |96 |308 10 |18092 
involved in differentiation 75730 30 

GO:BP | regulation of cell cycle G2/M GO:1902749 — |0.013769635310 | 1.8610775619027 |219 |308 15 |18092 
phase transition 490200 800 

GO:BP | complement activation, classical | GO:0006958 | 0.014475911655 | 1.8393540760078 | 142 |308 12 |18092 
pathway 081000 800 

GO:BP | morphogenesis of an epithelium | GO:0002009 | 0.016365063780 | 1.7860822977585 |576 |308 26 |18092 
25140 30 

GO:BP |cell growth GO:0016049 | 0.016467010298 | 1.7833852428859 |506 |308 24 |18092 
65660 000 

GO:BP | establishment of protein GO:0090150 | 0.017249493479 | 1.7632236531858 |342 |308 19 |18092 
localization to membrane 549100 70 

GO:BP | response to vitamin GO:0033273 | 0.018064312383 | 1.7431785651606 |99 |308 10 |18092 
618000 70 

GO:BP | terpenoid metabolic process GO:0006721 | 0.020657249573 | 1.6849275034555 | 123 |308 11 |18092 
878100 600 

GO:BP | protein localization to GO:0070972 | 0.020673739946 | 1.6845809509959 | 147 |308 12 |18092 
endoplasmic reticulum 093000 800 

GO:BP | plasma membrane bounded cell | GO:0120039 | 0.020974452002 | 1.6783093771206 |694 |308 29 |18092 
projection morphogenesis 53790 100 

СО:ВР | negative regulation of response to | GO:0032102 | 0.022213485989 | 1.6533832817464 | 413 |308 21 |18092 
external stimulus 145200 000 

GO:BP | negative regulation of cellular | GO:0051271 | 0.022894545621 | 1.6402679713033 | 381 |308 20 [|18092 
component movement 969600 300 

GO:BP | cell projection morphogenesis | GO:0048858 | 0.023384877996 | 1.6310648915578 | 698 |308 29 |18092 
033200 200 

GO:BP | cellular component GO:0032989 | 0.023894068554 | 1.6217098946088 | 813 |308 32 | 18092 
morphogenesis 36900 600 

GO:BP | chylomicron remodeling GO:0034371 | 0.024258017719 | 1.6151446910435 | 9 308 4 18092 
751500 300 

GO:BP | heterotypic cell-cell adhesion GO:0034113 | 0.024509853590 | 1.6106592829979 | 62 308 8 18092 
406000 700 

GO:BP | blood coagulation, intrinsic GO:0007597 | 0.024717884641 | 1.6069886989540 | 18 308 5 18092 
pathway 193300 400 

GO:BP | high-density lipoprotein particle | GO:0034375 | 0.024717884641 | 1.6069886989540 | 18 308 5 18092 
remodeling 193300 400 

GO:BP | organic acid metabolic process | GO:0006082 | 0.024812003725 | 1.6053381622899 | 1094 | 308 зә [18092 
967000 700 

СО:ВР | positive regulation of endocytosis | GO:0045807 | 0.025764299921 | 1.5889816538835 | 103 [308 10 |18092 


09340 


500 
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GO:BP | protein targeting to membrane | GO:0006612 | 0.026172734786 | 1.5821508956027 |203 |308 14 |18092 
400400 900 

GO:BP | extracellular matrix disassembly | GO:0022617 | 0.026343977656 | 1.5793186506408 | 82 308 9 18092 
476100 700 

GO:BP | growth GO:0040007 | 0.026782127878 | 1.5721549206393 | 1016 | 308 37 | 18092 
126900 100 

СО:ВР | response to organic cyclic GO:0014070 | 0.028221859992 | 1.5494143669931 |978 |308 36 |18092 
compound 0619 000 

GO:BP | protein-containing complex GO:0043933 | 0.028654926465 | 1.5428007016878 | 1974 |308 59 |18092 
subunit organization 956100 400 

GO:BP | regulation of response to external | GO:0032101 | 0.028749151011 | 1.5413749758965 | 1060 |308 38 |18092 
stimulus 38610 900 

GO:BP | endothelial cell migration GO:0043542 | 0.030104082756 | 1.5213746008032 |293 |308 17 |18092 
663800 30 

GO:BP | animal organ morphogenesis GO:0009887 | 0.031855506621 | 1.4968154836261 | 1106 | 308 39 |18092 
27800 80 

GO:BP | regulation of response to stress | GO:0080134 | 0.038031085788 | 1.4198612752941 | 1455 |308 47 | 18092 
35740 600 

GO:BP | cell part morphogenesis GO:0032990 | 0.038683299653 | 1.4124764880513 |717 |308 29 |18092 
24820 300 

GO:BP | glycosaminoglycan biosynthetic | GO:0006024 | 0.039241071244 | 1.4062591453027 | 108 |308 10 |18092 
process 031000 600 

GO:BP | neuron projection development | GO:0031175 | 0.039415353392 | 1.4043345752584 | 1034 | 308 37 | 18092 
69590 000 

GO:BP | neuron projection morphogenesis | GO:0048812 | 0.039497662924 | 1.4034286008098 | 680 | зов 28 |18092 
01160 200 

GO:BP | bone trabecula formation GO:0060346 | 0.039887860829 | 1.3991592541134 |10 |308 4 18092 
20470 100 

GO:BP | regulation of intracellular signal | GO:1902531 | 0.043322860168 | 1.3632828790475 | 1864 |308 56 [18092 
transduction 34470 200 

СО:ВР | modulation by host of viral GO:0044788 | 0.043582871082 | 1.3606841633936 | 33 308 6 18092 
process 32300 000 

GO:BP | plasma lipoprotein particle GO:0034369 | 0.043582871082 | 1.3606841633936 | 33 308 6 18092 
remodeling 32300 000 

GO:BP | protein-lipid complex remodeling | GO:0034368 | 0.043582871082 | 1.3606841633936 | 33 308 6 18092 
32300 000 

GO:BP | regulation of protein processing | GO:0070613 | 0.043924289618 | 1.3572952535657 | 67 308 8 18092 
008900 00 

GO:BP | negative regulation of signal GO:0009968 | 0.044448693797 | 1.3521409970085 | 1377 | 308 45 |18092 
transduction 45640 500 

GO:BP | tissue homeostasis GO:0001894 | 0.045902773390 | 1.3381610741139 |272 |308 16 |18092 
9766 300 

GO:BP | cellular response to oxygen- GO:1901701 | 0.046604555311 | 1.3315716315931 | 1251 |308 42 |18092 
containing compound 604600 700 

GO:BP | regulation of receptor-mediated | GO:0048259 | 0.049938933236 | 1.3015607390049 | 111 |308 10 |18092 
endocytosis 62680 900 

GO:CC | extracellular exosome GO:0070062 | 7.677610029782. | 138.11477395085 | 2180 |308 223 |18963 
28E-139 60 

GO:CC | extracellular vesicle GO:1903561 | 9.416473618674 | 137.02611170598 | 2266 | 308 225 | 18963 
87E-138 400 

GO:CC | extracellular organelle GO:0043230 | 1.149086414652 | 136.93964730987 | 2268 |308 225 |18963 
7Е-137 10 

GO:CC | extracellular space GO:0005615 | 6.701719866510 | 129.17381372973 | 3598 |308 255 |18963 
58Е-130 400 

GO:CC | extracellular region GO:0005576 | 1.786238614591 | 121.74806052636 | 4602 |308 270 |18963 
79Е-122 700 

GO:CC | vesicle GO:0031982 | 1.795079519710 | 97.745916307973 | 4071 |308 240 |18963 


53E-98 


40 


279 


GO:CC | collagen-containing extracellular | GO:0062023 | 5.144066050655 | 63.288693463635 | 424 |308 83 | 18963 
matrix 23E-64 40 
GO:CC | extracellular matrix GO:0031012 | 5.818054722958 |56.235222198220 |564 |308 86 |18963 
06Е-57 100 
GO:CC | blood microparticle GO:0072562 | 4.091195817991 | 44.388149733252 | 141 |308 46 |18963 
Е-45 10 
GO:CC | vesicle lumen GO:0031983 | 1.249225603205 | 43.903359123371 |326 |308 61 |18963 
03E-44 800 
GO:CC | secretory granule lumen GO:0034774 | 6.723026936890 | 43.172435148671 |320 |308 60 |18963 
22E-44 
GO:CC | cytoplasmic vesicle lumen GO:0060205 | 1.453175557724 | 42.837681915539 |324 |308 60 |18963 
77E-43 90 
GO:CC | secretory granule GO:0030141 |7.641382578812. | 31.116828056080 | 855 |308 75 |18963 
3E-32 70 
GO:CC | endoplasmic reticulum lumen |GO:0005788 | 1.314560075406 | 28.881219561842 |307 |308 47 |18963 
18E-29 700 
GO:CC | cell-substrate junction GO:0030055 | 4.919365583742 | 28.308090901550 |426 |308 53 |18963 
43E-29 80 
GO:CC | secretory vesicle GO:0099503 | 6.283557782626 | 28.201794386859 | 1028 |308 78 | 18963 
17E-29 800 
GO:CC | focal adhesion GO:0005925 | 1.946378646224 |27.710772668708 | 418 |308 52 |18963 
71Е-28 20 
GO:CC | cytoplasmic vesicle GO:0031410 | 4.781349656298 | 24.320449495532 | 2437 | 308 114 |18963 
14E-25 800 
GO:CC | intracellular v GO:0097708 | 5.526367613173 |24.257560229252 |2441 |308 114 |18963 
84E-25 50 
GO:CC |endomembrane system GO:0012505 | 3.280515357635 |22.484057924765 | 4633 |308 160 |18963 
49E-23 200 
GO:CC |anchoring junction GO:0070161 | 2.469791727890 | 21.607339668295 | 843 |308 63 |18963 
528-22 600 
GO:CC |membrane-bounded organelle | GO:0043227 | 1.165817182271 | 19.933369548166 | 12836 | 308 281 |18963 
5Е-20 800 
GO:CC | platelet alpha granule lumen GO:0031093 | 1.847378586893 | 18.733444094624 |66 |308 2 |18963 
26Е-19 6 
GO:CC | cytoplasm GO:0005737 | 3.272866013785 | 17.485071773691 | 11888 |308 266 |18963 
69Е-18 300 
GO:CC | platelet alpha granule GO:0031091 | 1.199170418407 | 16.921119093369 |90 |308 22 | 18963 
118-17 40 
GO:CC | ficolin-1-rich granule lumen GO:1904813 | 8.378078052462 | 16.076855597950 | 124 |308 24 |18963 
84E-17 000 
GO:CC | cell junction GO:0030054 | 3.826589643954 | 14.417188108175 |2131 |308 89 [18963 
098-15 700 
GO:CC | organelle GO:0043226 | 2.277604161078 | 13.642521752424 | 13974 |308 284 |18963 
28E-14 300 
GO:CC | basement membrane GO:0005604 | 8.075554398845 | 13.092827653009 | 103 |308 20 [18963 
75Е-14 90 
GO:CC | cytosol GO:0005829 | 2.094790777964 | 12.678859346689 | 5231 |308 151 |18963 
35Е-13 600 
GO:CC | vacuolar lumen GO:0005775 | 2.821050074050 | 12.549589204985 |174 |308 24 |18963 
45Е-13 800 
GO:CC | ficolin-1-rich granule GO:0101002 | 1.024981524796 | 11.989283962658 | 184 |308 24 |18963 
22E-12 30 
GO:CC | melanosome GO:0042470 | 1.800377156889 |11.744636506008 | 105 |308 1 |18963 
74E-12 90 
GO:CC | pigment granule GO:0048770 | 1.800377156889 |11.744636506008 | 105 |308 1 |18963 
74Е-12 90 
GO:CC | collagen trimer GO:0005581 | 4.060689117657 | 11.391400258393 | 95 308 18 |18963 


64E-12 


200 


280 


GO:CC | membrane-enclosed lumen GO:0031974 | 5.047159697846 | 11.296952953479 | 5468 |308 152 |18963 
18E-12 900 

GO:CC_| organelle lumen GO:0043233 | 5.047159697846 | 11.296952953479 | 5468 | 308 152 |18963 
18E-12 900 

GO:CC | endoplasmic reticulum GO:0005783 | 5.103424518268 |11.202138304238 | 1975 |308 79 |18963 
72Е-12 200 

GO:CC | сей surface GO:0009986 | 1.482128071390 | 10.829114267142 |922 |308 50 |18963 
55Е-11 80 

GO:CC | lysosome GO:0005764 | 2.542812714684 | 10.594685625657 | 709 |308 аз |18963 
46E-11 7 

GO:CC_| lytic vacuole GO:0000323 | 2.542812714684 | 10.594685625657 | 709 |308 аз | 18963 
46E-11 7 

GO:CC | supramolecular polymer GO:0099081 | 1.465054598939 | 9.8341461899335 | 1014 |308 51 |18963 
72Е-10 00 

GO:CC | supramolecular fiber GO:0099512 | 3.998765673176 | 9.3980740446857 | 1006 | 308 50 |18963 
05Е-10 60 

СО:СС | vacuole GO:0005773 | 4.945690751127 | 9.3057730431056 | 807 |308 44 |18963 
73E-10 70 

GO:CC | protein-containing complex GO:0032991 | 6.551906550572 |9.1836323055181 | 5556 |308 148 |18963 
67E-10 00 

GO:CC | supramolecular complex GO:0099080 | 1.388071561324 | 8.8575881434726 | 1337 | 308 58 | 18963 
21E-09 

GO:CC | endocytic vesicle lumen GO:0071682 | 3.009987583772 | 8.5214352958712 |20 |308 9 18963 
84E-09 90 

GO:CC GO:0030139 | 1.046145632059 | 6.9804078539032 |313 |308 24 |18963 
89Е-07 300 

GO:CC | fibrinogen complex GO:0005577 | 1.426916679480 | 6.8456013854686 | 8 308 6 18963 
52E-07 

СО:СС | complex of collagen trimers GO:0098644 | 2.243127080312 | 6.6491461215507 | 21 308 8 18963 
62E-07 20 

GO:CC | platelet dense granule lumen GO:0031089 | 2.596892771551 | 6.5855459824789 |14 |308 7 18963 
19E-07 20 

GO:CC | perinuclear region of cytoplasm | GO:0048471 | 0.000001349743 | 5.8697486046274 | 740 |308 36 |18963 
9682645700 30 

GO:CC | lysosomal lumen GO:0043202 | 0.000001749294 | 5.7571369740831 | 97 308 1 |18963 
8836575400 20 

GO:CC | сей cortex GO:0005938 | 0.000003858851 | 5.4135419485337 |318 |308 22 | 18963 
371228060 00 

СО:СС | fibrillar collagen trimer GO:0005583 | 0.000004457060 | 5.3509514944206 | 12 308 6 18963 
2573208100 60 

GO:CC | banded collagen fibril GO:0098643 | 0.000004457060 | 5.3509514944206 | 12 308 6 18963 
2573208100 60 

GO:CC | platelet dense granule GO:0042827 | 0.000007979099 | 5.0980461344156 | 21 308 7 18963 
220573580 20 

GO:CC | azurophil granule GO:0042582 | 0.000009944418 | 5.0024206116968 | 154 |308 15 |18963 
39640148 8 

GO:CC | primary lysosome GO:0005766 | 0.000009944418 | 5.0024206116968 | 154 |308 15 |18963 
39640148 8 

GO:CC | synapse GO:0045202 | 0.000021378588 | 4.6700209642773 | 1382 |308 50 |18963 
88217740 30 

GO:CC | cytosolic ribosome GO:0022626 | 0.000062632838 | 4.2031979081231 | 109 |308 12 |18963 
16021780 600 

GO:CC_ | cell-cell junction GO:0005911 | 0.000075344313 | 4.1229495213794 | 506 |308 26 |18963 
23337430 40 

GO:CC | azurophil granule lumen GO:0035578 | 0.000077571618 |4.1102971452202 | 91 308 11 |18963 
83308630 4 

СО:СС | intracellular anatomical structure | GO:0005622 | 0.000081102442 | 4.0909660637546 | 14764 | 308 274 |18963 
9712922 60 
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GO:CC | cortical cytoskeleton GO:0030863 | 0.000093222697 | 4.0304783322456 | 113 |308 12 |18963 
97083950 10 

GO:CC | high-density lipoprotein particle | GO:0034364 | 0.000095793292 | 4.0186649012351 |29 |308 7 18963 
09517390 00 

GO:CC | haptoglobin-hemoglobin complex | GO:0031838 | 0.000139721796 | 3.8547358383442 |11 308 5 18963 
70235400 90 

GO:CC | chylomicron GO:0042627 | 0.000581646366 | 3.2353409803856 |14 |308 5 18963 
7419890 900 

GO:CC | plasma lipoprotein particle GO:0034358 | 0.000683341924 | 3.1653619334197 | 38 308 7 18963 
716555 100 

GO:CC | lipoprotein particle GO:1990777 | 0.000683341924 | 3.1653619334197 | 38 308 7 18963 
716555 100 

GO:CC | membrane attack complex GO:0005579 | 0.000688221946 | 3.1622714822276 | 7 308 4 18963 
7886950 600 

GO:CC | sarcolemma GO:0042383 | 0.000800443235 | 3.0966694611983 | 138 |308 12 |18963 
5496520 50 

GO:CC | intracellular organelle lumen GO:0070013 | 0.001002081324 | 2.9990970315855 | 5304 |308 123 |18963 
5658900 300 

GO:CC | actin cytoskeleton GO:0015629 | 0.001135937521 | 2.9446455548802 |513 |308 24 |18963 
5645400 400 

GO:CC | protein-lipid complex GO:0032994 | 0.001167572584 | 2.9327161109914 |41 308 7 18963 
8777400 000 

GO:CC | collagen beaded filament GO:0098647 | 0.001270608119 | 2.8959883738651 | 3 308 3 18963 
1421600 000 

GO:CC | collagen type VI trimer GO:0005589 | 0.001270608119 | 2.8959883738651 | 3 308 3 18963 
1421600 000 

GO:CC | laminin-10 complex GO:0043259 | 0.001270608119 | 2.8959883738651 | 3 308 3 18963 
1421600 000 

GO:CC | Golgi lumen GO:0005796 | 0.001437727107 | 2.8423235389539 |99 |308 10 |18963 
045800 10 

GO:CC | cell projection GO:0042995 | 0.001565430452 | 2.8053662219961 | 2352 |308 66 |18963 
7051900 50 

GO:CC | intracellular non-membrane- GO:0043232 | 0.001947670762 | 2.7104844550338 | 4326 |308 104 |18963 
bounded organelle 8574700 00 

GO:CC | non-membrane-bounded GO:0043228 | 0.002098378762 | 2.6781161179354 | 4333 |308 104 |18963 
organelle 088650 20 

СО:СС |podosome GO:0002102 | 0.002238247611 | 2.6500918702360 |30 |308 6 18963 
7435100 00 

GO:CC | сей periphery GO:0071944 | 0.002381677486 | 2.6231170486355 | 5786 |308 130 |18963 
3688900 20 

GO:CC | cortical actin cytoskeleton GO:0030864 | 0.002402563135 | 2.6193251910380 | 83 308 9 18963 
00242 200 

СО:СС | cytoskeleton GO:0005856 | 0.003052102064 | 2.5154009473935 | 2302 |308 64 |18963 
2134700 800 

GO:CC | proteasome complex GO:0000502 | 0.003357098929 | 2.4740358604571 |66 |308 8 18963 
4374000 70 

GO:CC | glutamatergic synapse GO:0098978 | 0.003667894489 | 2.4355831657613 |373 |308 19 |18963 
1447400 60 

GO:CC | triglyceride-rich plasma GO:0034385 | 0.005384514080 | 2.2688534830105 | 21 308 5 18963 
lipoprotein particle 635100 100 

GO:CC | very-low-density lipoprotein GO:0034361 | 0.005384514080 | 2.2688534830105 | 21 308 5 18963 
particle 635100 100 

GO:CC | external side of plasma GO:0009897 | 0.005759980368 | 2.2395789967410 |420 |308 20 [18963 
membrane 777480 500 

GO:CC | plasma membrane bounded cell | GO:0120025 | 0.006127708882 | 2.2127018755321 |2255 |308 62 |18963 
projection 20211 800 

GO:CC | endopeptidase complex GO:1905369 | 0.006430538531 | 2.1917526551853 | 72 308 8 18963 


0176400 


10 


282 


GO:CC | ribonucleoprotein complex GO:1990904 | 0.007862004323 | 2.1044667216723 | 693 |308 27 |18963 
767770 800 

GO:CC | lamellipodium GO:0030027 | 0.009175311632 | 2.0373791763326 | 204 |308 1з | 18963 
713730 000 

GO:CC | cytosolic large ribosomal subunit | GO:0022625 | 0.010996018085 | 1.9587645545384 |57 |308 7 | 18963 
288300 200 

СО:СС | side of membrane GO:0098552 | 0.012329032089 | 1.9090710170642 | 632 |308 25 | 18963 
182800 00 

GO:CC | intermediate-density lipoprotein | GO:0034363 | 0.012401453117 | 1.9065274242034 | 5 308 3 | 18963 
particle 334000 600 

GO:CC | hemoglobin complex GO:0005833 | 0.013014208507 | 1.8855822394985 |1з | 308 4 |18963 
106100 700 

GO:CC | ribosomal subunit GO:0044391 | 0.021192676492 | 1.6738141913548 | 191 |308 12 | 18963 
42720 900 

GO:CC | cell leading edge GO:0031252 | 0.024569257259 | 1.6096079722502 | 429 |308 19 | 18963 
66830 100 

GO:CC | plasma membrane GO:0005886 | 0.027593136851 | 1.5591989251220 | 5663 | 308 123 | 18963 
04550 800 

GO:CC | cytosolic small ribosomal subunit | GO:0022627 | 0.032101006573 | 1.4934813494560 | 47 |308 6 [18963 
04560 100 

GO:CC | microvillus GO:0005902 | 0.037786483428 | 1.4226635235001 |92 |308 8 | 18963 
605900 500 

СО:СС | adherens junction GO:0005912 | 0.042085202183 | 1.3758705820699 | 175 |308 11 | 18963 
35800 800 

KEGG | ECM-receptor interaction KEGG:04512 | 5.095553980262 | 20.292808593307 |88 |218 28 |17987 
86E-21 700 

KEGG | Focal adhesion KEGG:04510 | 3.657712750637 | 16.436790403756 | 200 |218 35 | 7987 
19E-17 20 

KEGG | Complement and coagulation KEGG:04610 | 3.867591358592 | 13.412559418788 |85 |218 22 | 7987 
cascades 86E-14 500 

KEGG | PI3K-Akt signaling pathway KEGG:04151 | 1.880103221876 | 8.7258183063422 |353 |218 35 |7987 
95Е-09 0% 

KEGG | Human papillomavirus infection | KEGG:05165 | 6.589188751555 | 8.1811680516291 |331 |218 зз |7987 
77E-09 2 

KEGG | Amocbiasis KEGG:05146 | 1.490842646867 | 6.8265681923641 |101 |218 17 |7987 
75E-07 20 

KEGG  |Glycolysis/Gluconeogenesis | KEGG:00010 | 2.332572189791 | 6.6321649065770 |67 |218 14 |7987 
12E-07 50 

КЕСС | Phagosome KEGG:04145 | 2.81159108956Е | 6.5510478417108 | 148 |218 20 |7987 
-07 90 

KEGG | Coronavirus disease - COVID-19 | KEGG:05171 | 0.000001481598 | 5.8292694315496 |231 |218 24 |7987 
6323689400 30 

KEGG | Proteoglycans in cancer KEGG:05205 | 0.000070586972 | 4.1512754420868 |205 |218 20 |7987 
93243770 50 

KEGG | Platelet activation KEGG:04611 | 0.000125596238 | 3.9010233654448 | 124 |218 15 | 7987 
9938840 30 

KEGG | Carbon metabolism KEGG:01200 | 0.000362764229 | 3.4403755426931 | 118 |218 14 | 7987 
97977000 800 

KEGG | Protein digestion and absorption | KEGG:04974 | 0.000406995798 | 3.3904100736140 |103 |218 1з | 7987 
91927700 700 

KEGG | Biosynthesis of amino acids KEGG:01230 | 0.000524160951 | 3.2805353359495 |75 |218 " [7987 
4256820 500 

KEGG | HIF-1 signaling pathway KEGG:04066 | 0.003918467746 | 2.4068837236720 | 109 |218 12 | 7987 
074300 200 

KEGG | Salmonella infection KEGG:05132 | 0.004769796699 | 2.3215001312693 |249 |218 19 |7987 
479230 100 

КЕСС | Prion disease KEGG:05020 | 0.005350386815 |2.2716148187961 |273 |218 20 |7987 
183960 300 
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KEGG | Systemic lupus erythematosus KEGG:05322 | 0.006592866409 | 2.1809257242520 | 133 218 13 7987 
1156700 10 

KEGG | Regulation of actin cytoskeleton | KEGG:04810 | 0.008732623040 | 2.0588552865631 | 217 218 17 7987 
141890 20 

KEGG | Pentose phosphate pathway KEGG:00030 | 0.013506770983 | 1.8694484638218 | 30 218 6 7987 
822200 9 

КЕСС | Staphylococcus aureus infection | KEGG:05150 | 0.017734671924 | 1.7511768411874 | 91 218 10 7987 
015100 00 

КЕСС | Malaria KEGG:05144 | 0.033865409705 | 1.4702436659032 | 49 218 T 7987 
90260 900 

KEGG | Cholesterol metabolism KEGG:04979 | 0.038478904782 | 1.4147772977004 | 50 218 7 7987 
172900 400 


Table Š 8. List of proteins candidate biomarkers identified іп A549 cell-derived exosomal proteome found at least twice with 
their Sum of PEP score. 


A549 exo Description # of replicates Sum of PEP score 
| proteins 

000468-5 Isoform 5 of Agrin 2 23.128 
000560-1 Syntenin-1 2 21.5605 
(043795 Unconventional myosin-Ib 2 35.1345 
P02786 Transferrin receptor protein 1 2 19.842 
P05023 Sodium/potassium-transporting ATPase subunit alpha-1 2 27.0535 
P06576 ATP synthase subunit beta, mitochondrial 2 36.7945 
P07355 Annexin A2 2 67.3405 
P08195 4F2 cell-surface antigen heavy chain 2 86.061 
P10809 60 kDa heat shock protein, mitochondrial 2 59.013 
P16401 Histone H1.5 2 23.368 
P25705-1 ATP synthase subunit alpha, mitochondrial 2 26.285 
Р26006 Integrin alpha-3 2 29.465 
P29401 Transketolase 2 19.679 
P40926 Malate dehydrogenase, mitochondrial 2 36.5445 
Р54652 Heat shock-related 70 kDa protein 2 2 25.768 
P62701 40S ribosomal protein S4, X isoform 2 15.7245 
P68104 Elongation factor 1-а1рһа 1 2 25.314 
Q00610-1 Clathrin heavy chain 1 2 85.869 
Q07020 60S ribosomal protein L18 2 19.4515 
(013885 Tubulin beta-2A chain £ 43.28 
Q7L7L0 Histone H2A type 3 2 21.9595 
Q9NZM1 Tubulin beta-1 chain 2 48.0405 
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Table $ 9. Protein concentration of the exosome samples (E- breath-derived exosomes from a single individual, WHB 
whole breath sample from a single individual, S- breath-derived exosomes collected from volunteers) in the vial. 


Sample name Peptide quant (ng/uL) V total (uL) Total protein (ng) Conc (mg/mL) 
EI 87.8 20 1756 0.0878 
E2 99.7 20 1994 0.0997 
E3 108 18 1944 0.108 
Е4 98.6 20 1972 0.0986 

WHBI 36.94 45 1662.3 0.03694 
WHB2 36.551 45 1644.795 0.036551 
WHB3 21.53 45 968.85 0.02153 
WHB4 21.815 45 981.675 0.021815 
51 21.873 45 984.285 0.021873 
52 18.37 45 826.65 0.01837 
S3 13.799 20 275.98 0.013799 


Table S 10. List of proteins found at least once with a high quality peptide match identified in breath-derived exosomes 


collected from volunteers. 


Protein ID Protein ID Protein ID Protein ID Protein ID Protein ID 
A0A075B6H7 095498 P06870 P22392 P60660 QSJTV8 
A0A075B6H9 095716 P07195 P22735 P60842 Q5QNW6 
А0А075В616 Р00338 Р07237 P22894 P60900 QS5SSGS8 
А0А075В619 Р00450 Р07339 Р23246 Р60953 0557К8 
A0A075B6J9 P00491 P07355 P23280 P61019 Q5T749 
A0A075B6K4 P00558 P07358 P23284 P61020 057750 
A0A075B6S5 P00734 P07360 P23381 P61026 QSTEC6 
A0A075B7B8 P00736 P07437 P23526 P61106 QSVT79 
А0А075В700 Р00738 Р07476 P23528 Р61158 Q5VVJ2 
A0A087WSY6 P00739 P07602 P24158 P61160 Q5VZK9 
A0A087WUI2 P00747 P07737 P24534 P61224 Q68DW7 
A0A087X0Q4 P00751 P07858 P25311 P61247 Q6A163 
А0А0А0М071 Р01008 Р07864 P25705 Р61313 Q6FSE8 
A0A0A0MRZ8 P01009 P07900 P26038 P61626 Q6MZM9 
A0A0A0MS15 Р01011 Р07996 Р26447 Р61769 Q6P4A8 
A0A0A0MT36 P01023 P08123 P26583 P61978 Q6P5S2 
A0A0B4J1U3 P01024 P08133 P26641 P61981 Q6PCB0 
A0A0B4J1U7 P01033 P08174 P27105 P62081 Q6UWP8 
AOAOB4J1VO Р01034 Р08238 P27348 P62158 Q6XQN6 
А0А0В4ЛХ5 Р01036 Р08246 P27482 P62241 Q6YP21 
AOAOB4J1Y9 Р01037 Р08294 Р27487 Р62269 062514 
A0A0C4DGG8 P01040 P08311 P27797 P62328 О70199 


285 


A0A0C4DH25 P01042 P08493 P27918 P62491 Q71U36 
A0A0C4DH29 P01591 P08571 P28062 P62701 Q75VX8 
A0A0C4DH31 P01602 P08575 P28066 P62714 Q7RTR2 
A0A0C4DH35 P01614 P08582 P28325 P62753 О77569 
A0A0C4DH38 P01619 P08603 P28676 P62805 0727794 
А0А0С4рН55 Р01624 Р08637 Р28799 Р62820 Q7Z7M9 
AO0A0CADH68 P01699 P08670 P29350 P62826 Q86UX7 
A0A0C4DH72 P01700 P08758 P29373 P62857 Q86V81 
A0A0G2JMB2 P01701 P08779 P29401 P62873 Q86VR7 
A0A0G2JPRO P01703 Р08865 Р29508 Р62937 Q86VZ4 
A0A0G2JRQ6 P01706 P09211 P30040 P62987 Q86X29 
A0A0J9YY99 P01709 P09228 P30041 P63104 086ХІ8 
A0A140TA29 P01714 P09237 P30044 P63241 Q86Y34 
AOAIX7SBS1 Р01718 Р09382 Р30086 Р67775 Q86Y46 
AO0A2Q2TTZ9 P01721 P09429 P30153 P68104 Q86YZ3 
AOA2R8Y5S7 P01762 P09493 P30273 P68133 QSIWL2 
AOA2R8YEA7 Р01763 Р09958 P30501 P68366 081783 
АОАЗВЗІКМ5 Р01766 Р09972 Р30520 Р68371 QSIZPO 
A0M8Q6 P01768 P0CG39 P30626 P68871 Q8N0Y7 
A8K2U0 P01782 PODJI8 P30740 P69905 Q8NIN4 
A8MTF8 Р01833 PODJI9 P30838 P78371 Q8N4F0 
ARMUUI P01834 PODMV8 P31025 P80188 Q8NBB4 
ARMVUI P01857 PODOX7 P31146 P80511 Q8NBJA 
B3EWG6 Р01859 PODOY2 P31150 P80723 Q8NEL9 
B4DEH5 P01860 PODOY3 P31151 P80748 Q8TAA3 
B9A064 P01861 PODPOI P31943 P81605 Q8TAX7 
C9IZR8 P01871 PODP04 P31944 P84243 Q8TCU6 
C9J7B7 P01876 P10153 P31946 P98160 Q8TDLS 
C9J8J7 P02042 P10599 P31947 Q00610 Q8TEA7 
C9JX95 P02452 P10606 P31949 Q01469 О8ТЕ09 
D3DSM0 P02533 P10809 P31997 Q01518 Q8WUM4 
D6RF35 P02538 P10909 P32119 Q01546 Q92743 
E9PDK7 P02545 P11021 P32320 001968 092817 
E9PKU7 P02647 P11142 P32926 Q02413 Q92876 
E9PLE9 P02649 P11215 P33241 Q02487 Q92882 
E9PLZ3 P02652 P11413 P34096 Q02543 Q92888 
F5GXS2 P02654 P12109 P34931 Q02809 Q92896 
G3VIRI P02656 P12236 P35268 Q02818 Q96AE4 
G3VIR6 P02671 P12273 P35527 Q03591 096В01 
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G5EA09 P02675 P12429 P35579 Q04446 Q96C03 
H0Y4V9 P02679 P12724 P35754 Q04609 Q96DA0 
H0Y8Z5 P02748 P12814 P35858 Q04695 Q96DR5 
HOYAL9 P02749 Р13639 P35908 Q04917 Q96DR8 
HOYERO Р02751 Р13645 P36578 005315 096607 
НОҮЕЗ2 Р02760 P13646 36952 Q05639 Q96HE7 
HOYGX7 P02763 P13647 P37802 Q06830 Q96P63 
H0YJW5 P02765 P13667 P37837 Q07020 Q96QB1 
H3BSZ8 P02766 P13716 P38606 007065 Q96QR1 
H7BZJ3 P02768 P13796 P39019 Q07654 Q99439 
Н7С169 Р02787 Р13797 Р39060 007955 099497 
J3QLF6 P02788 P13804 P39687 Q07960 099519 
150537 Р02790 Р13929 Р40121 008188 099536 
K7ELHS 02794 P13987 Р40197 008380 099715 
K7EPJ3 P02808 P14174 P40227 Q08431 Q99829 
K7EQX2 P02810 P14314 P40394 Q08554 Q99935 
M0QY96 P02812 P14317 P40926 008,18 Q9BRA2 
M0R251 P02814 P14618 P41218 Q09666 Q9BRK5 
O00160 P03973 P14625 P41240 Q0VD83 Q9BVC6 
O00299 P04004 P14780 P42167 011201 Q9BW30 
O00391 P04040 P14854 P43490 Q12931 Q9BXJ4 
O00592 P04066 P14923 P46783 Q13043 Q9GZM7 
O00602 P04075 P15104 P46940 013126 090228 
000748 P04080 P15153 Р47755 013162 09Н0Е2 
014647 Р04083 P15311 P47929 013217 Q9H173 
014727 Р04114 Р15328 р48382 013231 Q9HIEI 
014745 Р04196 Р15515 Р48643 013421 Q9H299 
014773 P04217 Р15516 Р49189 013438 Q9H2K2 
O14950 P04259 P15814 P49368 013561 Q9H4G4 
O14983 P04264 P15924 P49458 013748 09Н4М9 
O15131 P04350 P15941 P49788 013813 О9Н7Е2 
015143 Р04406 Р16070 P49913 Q13835 Q9HC84 
O15144 P04430 P16401 р50395 014005 Q9HCY8 
O15145 P04432 P16402 р50552 014103 Q9HD89 
O15511 P04632 P16870 PS50570 014118 093038 
043240 P04745 Р17213 Р50990 014134 Q9NYL9 
O43296 P04792 Р17858 P50991 014254 Q9NZTI 
043399 Р04839 Р17931 P50995 014508 Q9UBC9 
043490 Р04899 P17980 P51148 014515 Q9UBG3 
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043504 Р05089 Р17987 P51149 Q14520 Q9UBQO 
043866 Р05090 P18124 р51858 014624 Q9UBR2 
O60234 P05109 P18510 Р51888 014764 Q9UBX7 
O60343 P05155 P18669 PS51991 Q14847 Q9UEF7 
O60437 P05164 P19013 PS51993 QI4CN2 Q9UGM3 
060565 Р05166 Р19823 р52209 015080 09019 
060664 P05204 P19878 р52565 015084 090706 
060701 Р05387 Р19957 P52790 015149 Q9UKGI 
060812 Р06310 Р19971 P52907 015365 Q9UKRO 
060814 P06312 P20061 р53634 015517 Q9UKR3 
O75015 P06331 P20160 PS54108 Q15691 Q9UL52 
O75083 P06396 P20618 PS5000 Q15847 Q9UQ07 
O75367 P06576 P20742 PS55058 Q15904 Q9Y272 
O75531 P06702 P20810 PS55072 Q15942 Q9Y2J8 
O75594 P06703 P20930 PS55268 Q16610 Q9Y490 
O75874 P06727 P21281 PS55786 Q16629 Q9Y4L1 
O75888 P06731 P21333 PS8107 Q16777 Q9Y653 
O75955 P06733 P21926 Р59665 016851 Q9Y6R7 
O94985 P06737 P22061 PS59998 Q2I0M4 S4R3R4 
O95171 P06744 P22079 P60022 Q32P51 S4R460 
O95274 P06748 P22307 P60033 Q562R1 

O95445 P06753 P22314 P60174 Q5D862 


Table S 11. Raw list of proteins found at least twice with a high quality peptide match identified as core breath proteome 


(297). 
Protein ID Protein ID Protein ID Protein ID Protein ID Protein ID 
A0A075B6H7 P01699 P06576 P14780 P35579 Q08188 
A0A075B619 P01700 P06702 P14923 P35908 Q08380 
A0A0A0MRZ8 P01701 P06703 P15153 P37837 Q09666 
A0A0A0MS15 P01714 P06733 P15311 P38606 011201 
A0A0B4J1V0 P01721 P06737 Р15515 Р40121 013043 
А0АОВ4Л Х5 Р01833 P06744 P15516 P41218 013162 
A0A0B4J1Y9 P01834 P06753 P15924 P47929 Q13231 
A0A0C4DH25 Р01859 Р06870 Р16401 Р49913 013438 
A0A0C4DH31 P01860 P07195 P16402 P50990 Q13835 
A0A0G2JMB2 P01861 P07237 P16870 Р51149 014103 
A0A2Q2TTZ9 Р01871 Р07339 Р18669 Р52209 014508 
A0A2R8YEA7 P01876 P07355 P19013 P52790 Q14515 
А0М806 P02533 P07437 P19823 Р53634 014764 
A8K2U0 P02538 P07737 P19971 P54108 Q15904 
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B4DEH5 P02647 P07900 P20061 P55000 016610 
В9А064 Р02671 Р07996 Р20160 Р59665 016777 
G3VIRI P02675 P08133 P21333 P59998 Q210M4 
H7BZJ3 P02679 P08238 P21926 P60022 032Р51 
000160 Р02749 Р08246 Р22079 P60660 Q562R1 
000299 P02751 Р08493 Р22314 Р60842 Q5QNW6 
000391 Р02760 Р08571 Р23280 Р61158 057749 
014773 Р02768 Р08670 Р23381 Р61160 Q6MZM9 
014950 Р02787 Р08779 P23528 Р61224 06Р552 
015143 Р02788 Р09211 Р24158 Р61626 Q6UWP8 
015144 Р02790 Р09228 Р25311 Р61769 971036 
015145 Р02808 Р09958 Р25705 Р61978 Q8NIN4 
015511 Р02810 PODJIS P27482 P62328 Q8N4F0 
O43240 P02812 PODJI9 P27487 P62805 Q8TAX7 
043490 Р02814 PODMV8 P28062 P62826 Q8TDL5 
O75083 P03973 PODOY3 P28066 P62937 Q8WUM4 
O75594 P04004 PODUB6 P28325 P62987 Q92743 
O75874 P04040 P10153 P28676 P63104 Q92817 
O75955 P04075 P10606 P28799 P68104 Q92882 
O95716 P04080 P10809 P29401 P68133 Q96DA0 
P00338 P04083 P10909 P30040 P68366 Q96DR5 
P00491 P04196 Р11021 Р30041 Р68371 Q96QR1 
Р00558 Р04259 Р11142 Р30153 Р68871 099536 
Р00734 P04264 P11413 P30740 P69905 Q9BRKS 
Р00747 Р04406 P12273 P31025 P78371 Q9HC84 
Р01008 Р04792 Р12429 P31146 P80188 Q9HD89 
P01009 P04839 P12724 P31151 P80748 Q9NZTI 
P01023 P04899 P13639 P31944 P81605 Q9UBG3 
P01024 P05089 P13645 P31946 Q01518 Q9UBX7 
P01034 P05090 P13646 P31947 Q02413 Q9UGM3 
P01036 Р05109 Р13647 Р31949 002818 09Ү490 
Р01037 Р05164 P13716 P32320 Q04695 Q9Y6R7 
P01040 P05204 P13796 P32926 Q05315 S4R460 
Р01591 Р06310 Р13987 Р33241 006830 

Р01614 Р06331 Р14174 Р34096 007065 

Р01619 Р06396 P14618 P35527 Q07654 
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Table S 12. Subset of core breath proteome found at least twice with a high quality peptide match consisting of 184 proteins. 


Protein ID Protein ID Protein ID Protein ID Protein ID Protein ID 
A0A075B6H7 P01036 P05204 P15515 P37837 Q14508 
A0A075B619 P01037 P06310 P15516 P38606 Q14515 
A0A0A0MRZ8 P01040 P06331 P16401 P40121 Q15904 
A0A0A0MS15 P01591 P06576 P16402 P47929 Q16777 
A0A0B4J1V0 P01614 P06703 P16870 P52209 Q210M4 
А0АОВАЛ Х5 Р01619 Р06733 Р19013 Р55000 Q32P51 
А0А0В4Л Ү9 Р01699 Р06753 Р19823 Р59998 056261 
А0А0САрН25 Р01700 Р06870 Р19971 P60022 QSQNW6 
A0A0C4DH31 P01701 P07195 P22079 P60660 Q5T749 
A0A0G2JMB2 Р01714 Р07237 P22314 P60842 Q6MZM9 
A0A2Q2TTZ9 Р01721 Р07737 P23280 P61158 06Р552 
A0A2R8YEA7 P01834 P08133 P23381 P61978 Q6UWP8 
А0М806 Р01859 Р08493 P23528 P62805 Q71U36 
A8K2U0 Р01860 Р08670 Р25311 Р62826 Q8NIN4 
B9A064 P01861 P08779 P25705 P62987 Q8N4F0 
G3VIRI P01871 P09228 P27482 P63104 Q8TAX7 
H7BZJ3 P01876 P09958 P27487 P68133 Q8TDL5 
O00160 P02533 PODJIS P28062 P69905 Q8WUM4 
O00299 P02538 PODJI9 P28325 P80748 Q92743 
O14773 P02760 PODOY3 P29401 P81605 Q92817 
O14950 P02790 PODUB6 P30040 Q02818 Q96DA0 
O15143 P02808 P10606 P30153 Q04695 Q96DR5 
O15144 P02810 P10809 P31025 Q05315 Q96QR1 
O15145 P02812 P11021 P31944 Q06830 Q9HC84 
O43240 P02814 Р11413 Р31946 Q07654 Q9UBG3 
O43490 P04004 P12273 P31947 Q08188 Q9UBX7 
O75955 P04083 P13645 P32926 Q09666 Q9UGM3 
P00338 P04259 P13646 P33241 Q11201 Q9Y6R7 
P00558 P04406 P13647 P34096 Q13043 S4R460 
P00734 P04792 P13796 P35527 Q13438 

P01008 P05090 P15311 P35908 Q14103 
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Table S 13. GO analysis of breath exosomal proteome (297 proteins). The analysis was performed by using g:Profiler 


software. 


*The list of proteins associated with each ontology is available on request deanna.shea@yuw.ac.nz 


Source Term name Term ID Adjusted p- | Negative logl Term Query | Inter Effectiv 
value Oof size sec e 
adjusted tion domain 
p-value size size 

СО:МЕ structural constituent of GO:0005200 3.56E-19 18.4491073 286 24 18679 
cytoskeleton 

GO:MF cell adhesion molecule GO:0050839 6.6NE-18 17.1775235 544 286 45 18679 
binding 

GO:MF antigen binding GO:0003823 1.39E-16 15.858446 163 286 26 18679 

GO:MF cadherin binding GO:0045296 6.00E-15 14.2220133 333 286 33 18679 

GO:MF endopeptidase inhibitor GO:0004866 1.22E-13 12.9120381 175 286 24 18679 
activity 

GO:MF peptidase inhibitor activity GO:0030414 2.68E-13 12.5712631 181 286 24 18679 

GO:MF peptidase regulator activity GO:0061134 3.43E-13 12.4648782 222 286 26 18679 

GO:MF endopeptidase regulator GO:0061135 5.71E-13 12.243628 187 286 24 18679 
activity 

GO:MF structural molecule activity GO:0005198 5.97E-13 12.2240176 701 286 44 18679 

GO:MF enzyme inhibitor activity GO:0004857 2.94E-11 10.5315229 391 286 31 18679 

GO:MF signaling receptor binding GO:0005102 2.14E-10 9.66993416 1551 286 63 18679 

СО:МЕ immunoglobulin receptor GO:0034987 2.73E-10 9.56398203 77 286 15 18679 
binding 

GO:MF protein-containing complex GO:0044877 6.19E-10 9.20827271 1304 286 56 18679 
binding 

GO:MF actin filament binding GO:0051015 6.46E-09 8.1900157 213 286 21 18679 

GO:MF identical protein binding GO:0042802 3.72Е-08 7.42940097 2063 286 70 18679 

СО:МЕ protease binding GO:0002020 7.52Е-08 7.12404684 131 286 16 18679 

СО:МЕ cysteine-type endopeptidase а О:0004869 1.36Е-07 6.86685915 52 286 11 18679 
inhibitor activity 

СО:МЕ actin binding GO:0003779 1.47E-07 6.83252306 449 286 28 18679 

GO:MF calcium-dependent protein GO:0048306 1.73E-07 6.76310209 83 286 13 18679 
binding 

GO:MF cytoskeletal protein binding GO:0008092 1.1828E-06 5.92710429 1001 286 42 18679 

GO:MF enzyme binding GO:0019899 3.2008E-06 5.49474041 2048 286 65 18679 

GO:MF serine-type endopeptidase GO:0004867 1.456E-05 4.83682839 98 286 12 18679 
inhibitor activity. 

GO:MF serine-type endopeptidase GO:0004252 1.978E-05 4.70377325 167 286 15 18679 
activity 

GO:MF antioxidant activity GO:0016209 3.92E-05 4.40670865 87 286 11 18679 

СО:МЕ peroxidase activity GO:0004601 4.3954E-05 4.3570064 53 286 9 18679 

GO:MF glycosaminoglycan binding GO:0005539 7.252Е-05 4,13954296 238 286 17 18679 

GO:MF serine-type peptidase activity GO:0008236 7.6602E-05 4.11575833 185 286 15 18679 

GO:MF oxidoreductase activity, acting | GO:0016684 8.4577E-05 4.07274748 57 286 9 18679 
on peroxide as acceptor 

GO:MF serine hydrolase activity GO:0017171 0.00010124 3.99466335 189 286 15 18679 

GO:MF carbohydrate derivative GO:0097367 0.00018875 3.72412397 2282 286 65 18679 
binding 

GO:MF lipopolysaccharide binding GO:0001530 0.00019499 3.70999876 32 286 7 18679 

GO:MF chaperone binding GO:0051087 0.00030091 3.52156904 106 286 11 18679 

СО:МЕ calcium ion binding GO:0005509 0.00033891 3.46991324 723 286 30 18679 

GO:MF protein folding chaperone GO:0044183 0.00045889 3.33828738 36 286 7 18679 

GO:MF unfolded protein binding GO:0051082 0.00057247 3.24224983 113 286 11 18679 

GO:MF $100 protein binding GO:0044548 0.00063723 3.19570144 14 286 S 18679 

GO:MF endopeptidase activity GO:0004175 0.00064796 3.18845381 444 286 22 18679 

GO:MF lipid binding GO:0008289 0.00135763 2.86721831 774 286 30 18679 
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СО:МЕ sulfur compound binding GO:1901681 0.00146993 2.83270313 264 286 16 18679 
GO:MF small molecule binding GO:0036094 0.0026774 2.57228599 2516 286 66 18679 
GO:MF molecular function regulator GO:0098772 0.00338698 2.47018793 1115 286 37 18679 
СО:МЕ enzyme regulator activity GO:0030234 0.00478119 2.32046362 911 286 32 18679 
GO:MF heparin binding GO:0008201 0.00489319 2.31040779 168 286 12 18679 
GO:MF disordered domain specific GO:0097718 0.00502854 2.29855818 34 286 6 18679 
GO:MF GT GO:0005488 0.00650883 2.18649739 17070 | 286 279 18679 
GO:MF peptidase activity GO:0008233 0.00988025 2.00523228 647 286 25 18679 
GO:MF phospholipase A2 inhibitor GO:0019834 0.01522927 1.81732102 5 286 3 18679 
GO:MF risit binding GO:0005515 0.017468 1.75775669 14767 | 286 252 18679 
GO:MF protein dimerization activity GO:0046983 0.02077981 1.68235835 1073 286 34 18679 
GO:MF protein homodimerization GO:0042803 0.02176566 1.66222821 679 286 25 18679 
GO:MF тт constituent of skin GO:0030280 0.02820419 154968642 15 286 4 18679 
epidermis 
GO:MF intermediate filament binding GO:0019215 0.02820419 1.54968642 15 286 4 18679 
GO:MF lipase inhibitor activity GO:0055102 0.04800195 1.31874111 17 286 4 18679 
GO:MF MHC class II protein complex | GO:0023026 0.04800195 1.31874111 17: 286 4 18679 
СО:МЕ Ша constituent of GO:0005200 3.56E-19 18.4491073 104 286 24 18679 
cytoskeleton 
GO:MF cell adhesion molecule GO:0050839 6.64E-18 17.1775235 544 286 45 18679 
GO:MF eines binding GO:0003823 1.39E-16 15.858446 163 286 26 18679 
GO:MF cadherin binding GO:0045296 6.00E-15 14.2220133 333 286 33 18679 
GO:MF endopeptidase inhibitor GO:0004866 1.22E-13 12.9120381 175 286 24 18679 
GO:MF ерні inhibitor activity GO:0030414 2.68E-13 12.5712631 181 286 24 18679 
GO:MF peptidase regulator activity GO:0061134 3.43E-13 12.4648782 222 286 26 18679 
GO:MF endopeptidase regulator GO:0061135 5.71Е-13 12.243628 187 286 24 18679 
СО:МЕ EET molecule activity GO:0005198 5.97E-13 12.2240176 701 286 44 18679 
GO:MF enzyme inhibitor activity GO:0004857 2.94E-11 10.5315229 391 286 31 18679 
GO:MF signaling receptor binding GO:0005102 2.14E-10 9.66993416 1551 286 63 18679 
GO:MF immunoglobulin receptor GO:0034987 2.73E-10 9.56398203 17. 286 15 18679 
СО:МЕ т complex GO:0044877 6.19E-10 9.20827271 1304 286 56 18679 
GO:MF em ocn binding GO:0051015 6.46E-09 8.1900157 213 286 21 18679 
GO:MF identical protein binding GO:0042802 3.72Е-08 7.42940097 2063 286 70 18679 
СО:МЕ protease binding GO:0002020 7.52Е-08 7.12404684. 131 286 16 18679 
GO:MF cysteine-type endopeptidase GO:0004869 1.36Е-07 6.86685915 52 286 п 18679 
inhibitor activity 
GO:MF actin binding GO:0003779 1.47Е-07 6.83252306 449 286 28 18679 
СО:МЕ calcium-dependent protein GO:0048306 1.73E-07 6.76310209 83 286 13 18679 
GO:MF гоа protein binding GO:0008092 1.1828Е-06 5.92710429 1001 286 42 18679 
GO:MF enzyme binding GO:0019899 3.2008E-06 5.49474041 2048 286 65 18679 
СО:МЕ serine-type endopeptidase GO:0004867 1.456E-05 4.83682839 98 286 12 18679 
inhibitor activity 
GO:MF serine-type endopeptidase GO:0004252 1.978E-05 4.70377325 167 286 15 18679 
GO:MF anton Liat activity GO:0016209 3.92E-05 4.40670865 87 286 11 18679 
СО:МЕ peroxidase activity GO:0004601 4.3954E-05 4.3570064 53 286 9 18679 
GO:MF glycosaminoglycan binding GO:0005539 7.252Е-05 4.13954296 238 286 17 18679 
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GO:MF serine-type peptidase activity GO:0008236 7.6602E-05 4.11575833 185 286 15 18679 
GO:MF oxidoreductase activity, acting | GO:0016684 8.4577Е-05 4.07274748 57 286 9 18679 
СО:МЕ serine hydrola: GO:0017171 0.00010124 3.99466335 189 286 15 18679 
GO:MF carbohydrate derivative GO:0097367 0.00018875 3.72412397 2282 286 65 18679 
GO:MF Е а binding GO:0001530 0.00019499 3.70999876 32 286 7 18679 
GO:MF chaperone binding GO:0051087 0.00030091 3.52156904 106 286 11 18679 
GO:MF calcium ion binding GO:0005509 0.00033891 3.46991324 723 286 30 18679 
GO:MF protein folding chaperone GO:0044183 0.00045889 3.33828738 36 286 7 18679 
GO:MF unfolded protein binding GO:0051082 0.00057247 3.24224983 113 286 11 18679 
СО:МЕ 5100 protein binding GO:0044548 0.00063723 3.19570144 14 286 5 18679 
СО:МЕ endopeptidase activity GO:0004175 0.00064796 3.18845381 444 286 22 18679 
GO:MF lipid binding GO:0008289 0.00135763 2.86721831 774 286 30 18679 
GO:MF sulfur compound binding GO:1901681 0.00146993 2.83270313 264 286 16 18679 
GO:MF small molecule binding GO:0036094 0.0026774 2.57228599 2516 286 66 18679 
GO:MF molecular function regulator GO:0098772 0.00338698 2.47018793 1115 286 37 18679 
GO:MF enzyme regulator activity GO:0030234 0.00478119 2.32046362 911 286 32 18679 
GO:MF heparin binding GO:0008201 0.00489319 2.31040779 168 286 12 18679 
GO:MF disordered domain specific GO:0097718 0.00502854 2.29855818 34 286 6 18679 
GO:MF Cm GO:0005488 0.00650883 2.18649739 17070 | 286 279 18679 
GO:MF peptidase activity GO:0008233 0.00988025 2.00523228 647 286 25 18679 
GO:MF phospholipase A2 inhibitor GO:0019834 0.01522927 1.81732102 5 286 3 18679 
GO:MF — binding GO:0005515 0.017468 1.75775669 14767 | 286 252 18679 
GO:MF protein dimerization activity GO:0046983 0.02077981 1.68235835 1073 286 34 18679 
GO:MF protein homodimerization GO:0042803 0.02176566 1.66222821 679 286 25 18679 
GO:MF SERES constituent of skin GO:0030280 0.02820419 1.54968642 15 286 4 18679 
GO:MF шие filament binding GO:0019215 0.02820419 1.54968642. 15 286 4 18679 
СО:МЕ lipase inhibitor activity GO:0055102 0.04800195 1.31874111 17 286 4 18679 
СО:МЕ MHC class II protein complex | GO:0023026 0.04800195 1.31874111 17 286 4 18679 
GO:BP um effector process GO:0002252 2.50E-78 77.6024241 1135 288 132 18123 
GO:BP leukocyte mediated immunity GO:0002443 1.56E-77 76.8074492 888 288 120 18123 
GO:BP regulated exocytosis GO:0045055 5.05E-75 74.2966595 790 288 113 18123 
GO:BP exocytosis GO:0006887 8.87E-72 71.0520087 906 288 116 18123 
СО:ВР immune response GO:0006955 2.26E-68 67.6464266 2544 288 170 18123 
GO:BP vesicle-mediated transport GO:0016192 1.89E-66 65.7242567 2193 288 158 18123 
GO:BP neutrophil mediated immunity | GO:0002446 1.44E-65 64.840947 500 288 89 18123 
GO:BP neutrophil degranulation GO:0043312 2.23E-64 63.6522074 483 288 87 18123 
СО:ВР neutrophil activation involved | GO:0002283 5.64E-64 63.2484677 488 288 87 18123 
in immune response 
GO:BP neutrophil activation GO:0042119 7.19E-63 62.1433144 502 288 87 18123 
СО:ВР leukocyte degranulation GO:0043299 1.19E-62 61.9233427 538 288 89 18123 
GO:BP myeloid leukocyte mediated GO:0002444 1.96E-62 61.7074614 558 288 90 18123 
immunity 
GO:BP granulocyte activation GO:0036230 2.49E-62 61.6041998 509 288 87 18123 
GO:BP cell activation GO:0001775 1.08Е-60 59.9681373 1507 288 130 18123 
GO:BP myeloid cell activation GO:0002275 2.06E-60 59.686503 552 288 88 18123 
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GO:BP immune system process GO:0002376 1.53E-59 58.8156547 3438 288 183 18123 
GO:BP secretion by cell GO:0032940 2.45Е-57 56.6101168 1351 288 121 18123 
СО:ВР | myeloid leukocyte activation | GO:0002274 | 2.49Е-56 55.6037848 671 288 91 18123 
GO:BP leukocyte activation involved GO:0002366 1.21Е-55 54.9159717 723 288 93 18123 
іп immune response 
GO:BP cell activation involved in GO:0002263 2.01E-55 54.6970091 727 288 93 18123 
immune response 
GO:BP export from cell GO:0140352 3.88E-55 54.4107173 1413 288 121 18123 
GO:BP secretion GO:0046903 2.17E-54 53.6632118 1492 288 123 18123 
GO:BP leukocyte activation GO:0045321 5.97E-50 49.224289 1341 288 113 18123 
GO:BP humoral immune response GO:0006959 8.06Е-45 44.0934814 383 288 65 18123 
СО:ВР transport GO:0006810 1.18Е-37 36.9299465 5214 288 192 18123 
GO:BP establishment of localization GO:0051234 3.98E-37 36.4006131 5364 288 194 18123 
GO:BP establishment of localization GO:0051649 1.71E-32 31.7672174 2778 288 133 18123 
in cell 
GO:BP antimicrobial humoral GO:0019730 1.95E-32 31.7096374 144 288 38 18123 
response 
GO:BP | defense response GO:0006952 | 2.39Е-31 30.6214787 1956 |288 110 18123 
СО:ВР biological process involvedin | GO:0044419 2.16E-29 28.6647252 1768 288 102 18123 
interspecies interaction 
between organisms 
GO:BP | response to external stimulus | GO:0009605 | 3.74E-29 28.4275491 3114 |288 136 18123 
GO:BP Tesponse to stress GO:0006950 1.50Е-28 27.8227604 4321 288 161 18123 
GO:BP response to other organism GO:0051707 2.10E-26 25.6773767 1600 288 93 18123 
GO:BP response to external biotic GO:0043207 2.21E-26 25.6564687 1601 288 93 18123 
stimulus 
GO:BP response to biotic stimulus GO:0009607 2.62Е-26 25.5818974 1639 288 94 18123 
GO:BP localization GO:0051179 4.11E-26 25.3866372 6938 288 204 18123 
GO:BP cellular localization GO:0051641 5.81E-26 25.2357235 3519 288 140 18123 
GO:BP | phagocytosis GO:0006909 | 1.52Е-25 24.8191105 388 288 48 18123 
СО:ВР defense response to other GO:0098542 2.48Е-25 24.605158 1222 288 80 18123 
GO:BP | defense response to bacterium | GO:0042742 | 1.58E-24 23.8012777 349 288 45 18123 
GO:BP peptide cross-linking GO:0018149 2.57Е-20 19.5899207 96 288 25 18123 
GO:BP complement activation GO:0006956 4.71E-20 19.3269671 178 288 31 18123 
GO:BP response to stimulus GO:0050896 8.55Е-20 19.0680445 9487 288 231 18123 
GO:BP response to bacterium GO:0009617 1.37E-18 17.8630065 743 288 55 18123 
GO:BP | endocytosis GO:0006897 | 3.08E-I8 17.5111988 700 288 53 18123 
GO:BP positive regulation ofimmune | GO:0050778 3.31E-18 17.4805084 757 288 55 18123 
respon 
GO:BP positive regulation of immune | GO:0002684 9.20E-18 17.0362858 1079 288 65 18123 
system process 
GO:BP | activation of immune response | GO:0002253 | 1.86E-17 16.7300364 566 288 47 18123 
GO:BP innate immune response GO:0045087 2.47Е-17 16.6068267 1003 288 62 18123 
GO:BP humoral immune response GO:0002455 2.59E-17 16.5863754 153 288 27 18123 
mediated by circulating 
immunoglobulin 
GO:BP complement activation, GO:0006958 3.57E-17 16.447622 140 288 26 18123 
classical pathway 
СО:ВР Fc-gamma receptor signaling GO:0038096 2.15E-16 15.6667281 135 288 25 18123 
pathway involved in 
phagocytosis 
СО:ВР immune response-regulating GO:0002433 2.15E-16 15.6667281 135 288 25 18123 
cell surface receptor signaling 
pathway involved in 
phagocytosis 
GO:BP receptor-mediated endocytosis | GO:0006898 2.31E-16 15.6367546 335 288 36 18123 
GO:BP Fc-gamma receptor signaling GO:0038094 3.14E-16 15.5032741 137 288 25 18123 
pathway 
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GO:BP regulation of immune system GO:0002682 3.68E-16 15.4346711 1734 288 81 18123 
process 

GO:BP Fc receptor mediated GO:0002431 6.54E-16 15.1847034 141 288 25 18123 
stimulatory signaling pathway 

GO:BP platelet degranulation GO:0002576 7.30E-16 15.1368364 127 288 24 18123 

GO:BP regulation of immune GO:0050776 1.75Е-15 14.7569082. 1159 288 64 18123 
response 

GO:BP phagocytosis, recognition GO:0006910 2.27Е-15 14.6440186 133 288 24 18123 

GO:BP lymphocyte mediated GO:0002449 2.51Е-15 14.6007503 360 288 36 18123 
immunity 

GO:BP cornification GO:0070268 9.78E-15 14.0094718 112 288 22 18123 

СО:ВР immune response-activating GO:0002429 1.21Е-14 13.9186618 475 288 40 18123 
cell surface receptor signaling 
pathway 

GO:BP immune response-activating GO:0002757 1.21Е-14 13.9186618 475 288 40 18123 

duction 

СО:ВР ponse-regulating GO:0002768 2.35E-14 13.6292597 510 288 4l 18123 
cell surface receptor signaling 
pathway 

GO:BP immune response-regulating GO:0002764 2.89E-14 13.5383815 513 288 4l 18123 
signaling pathway 

СО:ВР retina homeostasis GO:0001895 3.44E-14 13.4636898 79 288 19 18123 

GO:BP immunoglobulin mediated GO:0016064 4.56E-14 13.3413924 221 288 28 18123 
immune response 

GO:BP B cell mediated immunity GO:0019724 6.53E-14 13.1849054 224 288 28 18123 

СО:ВР negative regulation of GO:0010466 7.45Е-14 13.1280982 265 288 30 18123 
peptidase activity 

СО:ВР regulation of complement GO:0030449 1.07E-13 12.9708018 110 288 21 18123 
activation 

GO:BP antibacterial humoral response | GO:0019731 1.17E-13 12.9304088 61 288 17 18123 

GO:BP leukocyte migration GO:0050900 1.54E-13 12.8114101 511 288 40 18123 

GO:BP negative regulation of GO:0010951 1.66E-13 12.7807013 252 288 29 18123 
endopeptidase activity 

GO:BP regulation of response to GO:0032101 1.94E-13 12.7129891 1238 288 63 18123 
external stimulus 

GO:BP cell killing GO:0001906 2.14E-13 12.6692194 196 288 26 18123 

СО:ВР regulation of peptidase GO:0052547 2.77E-13 12.557545 467 288 38 18123 
activity 

GO:BP Fc receptor signaling pathway | GO:0038093 3.26E-13 12.4863695 238 288 28 18123 

СО:ВР regulation of response to GO:0080134 3.44Е-13 12.4635056 1626 288 73 18123 
str 

GO:BP positive regulation of response | GO:0048584 3.77E-13 12.4240307 2368 288 91 18123 
to stimulus 

GO:BP regulation of humoral immune | GO:0002920 5.44E-13 12.2645441 134 288 22 18123 
response 

GO:BP negative regulation of GO:0045861 7.42Е-13 12.129755 356 288 33 18123 
proteolysis 

GO:BP cell recognition GO:0008037 118Е-12 11.9282087 271 288 29 18123 

GO:BP adaptive immune response GO:0002460 1.68Е-12 11.7746189 366 288 33 18123 
based оп somatic 
recombination of immune 
receptors built from 
immunoglobulin superfamily 
domains 

GO:BP programmed cell death GO:0012501 4.28E-12 11.3688604 2158 288 84 18123 

СО:ВР negative regulation of GO:0032102 7.28Е-12 11.1376621 601 288 41 18123 
response to external stimulus 

GO:BP regulation of endopeptidase GO:0052548 7.98E-12 11.098218 437 288 35 18123 
activity 

GO:BP killing of cells of other GO:0031640 8.54E-12 11.0685313 7] 288 17 18123 
organism 

GO:BP regulation of proteolysis GO:0030162 9.00E-12 11.0459398 756 288 46 18123 

GO:BP cell death GO:0008219 2.16E-11 10.6648525 2306 288 86 18123 

GO:BP membrane organization GO:0061024 3.21E-11 10.4935133 1016 288 53 18123 

GO:BP locomotion GO:0040011 5.25E-11 10.2800628 2039 288 79 18123 

СО:ВР regulation of biological GO:0065008 5.97E-11 10.2237029 4116 288 123 18123 


quality 
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GO:BP cell motility GO:0048870 6.91E-11 10.1603288 1840 288 74 18123 

GO:BP localization of cell GO:0051674 6.91E-11 10.1603288 1840 288 74 18123 

GO:BP proteolysis GO:0006508 9.40E-11 10.0267583 1851 288 74 18123 

GO:BP hemostasis GO:0007599 2.12E-10 9.67405228 355 288 30 18123 

GO:BP biological proci involved іп | GO:0044403 2.24Е-10 9.64989467 331 288 29 18123 
symbiotic interaction 

СО:ВР movement of cell ог GO:0006928 2.63E-10 9.58000022 2319 288 84 18123 
subcellular component 

GO:BP opsonization GO:0008228 4.75Е-10 9.32345989 69 288 15 18123 

GO:BP keratinization GO:0031424 4.76Е-10 9.32273516 225 288 24 18123 

GO:BP | adaptive immune response GO:0002250 | 5.37E-10 9.26986814 715 288 42 18123 

GO:BP regulation of immune effector | GO:0002697 5.62E-10 9.25012504 421 288 32 18123 

GO:BP l process involved іп | GO:0051702 7.02E-10 9.15337235 99 288 17 18123 
interaction with symbiont 

GO:BP negative regulation of GO:0051346 9.13E-10 9.03960435 456 288 33 18123 
hydrolase activity. 

GO:BP blood coagulation GO:0007596 9.97E-10 9.00121333 351 288 29 18123 

GO:BP supramolecular fiber GO:0097435 1.10Е-09 8.96026071 863 288 46 18123 
organization 

GO:BP cell surface receptor signaling | GO:0007166 1.25E-09 8.90392992 3226 288 102 18123 
pathway 

GO:BP keratinocyte differentiation GO:0030216 1.41E-09 8.84929668 306 288 27 18123 

GO:BP coagulation GO:0050817 1.64E-09 8.78497833 358 288 29 18123 

GO:BP wound healing GO:0042060 1.87E-09 8.72731318 555 288 36 18123 

GO:BP response to wounding GO:0009611 2.05E-09 8.68879389 680 288 40 18123 

GO:BP phagocytosis, engulfment GO:0006911 3.20E-09 8.49482689 125 288 18 18123 

СО:ВР defense response to fungus GO:0050832 3.73E-09 8.42820974 42 288 12 18123 

GO:BP tissue homeostasis GO:0001894 4.29E-09 8.36721603 272 288 25 18123 

GO:BP cell migration GO:0016477 4.34E-09 8.36256424 1660 288 66 18123 

GO:BP antimicrobial humoral GO:0061844 5.75E-09 8.24026335 81 288 15 18123 
immune response mediated by 
antimicrobial peptide 

GO:BP negative regulation of blood GO:0030195 6.29E-09 8.20161699 55 288 13 18123 
coagulation 

GO:BP negative regulation of GO:1900047 8.07E-09 8.09305825 56 288 13 18123 
hemos 

СО:ВР fibrinolysis GO:0042730 9.33E-09 8.03032445 26 288 10 18123 

GO:BP regulation of blood GO:0030193 1.04E-08 7.9826602 70 288 14 18123 
coagulation 

GO:BP plasma membrane GO:0099024 1.08E-08 7.96770377 134 288 18 18123 
invagination 

GO:BP regulation of hemostasis GO:1900046 1.56E-08 7.80571485 72 288 14 18123 

GO:BP homotypic cell-cell adhesion GO:0034109 1.71E-08 7.7678501 87 288 15 18123 

GO:BP positive regulation of GO:0048518 2.15E-08 7.66680788 6474 288 159 18123 
biological process 

GO:BP organonitrogen compound GO:1901564 2.58E-08 7.58776195 6904 288 166 18123 
metabolic process 

GO:BP negative regulation of GO:0050819 2.61E-08 7.58298592 61 288 13 18123 
coagulation 

GO:BP anatomical structure GO:0060249 2.63E-08 7.57940677 486 288 32 18123 
homeosta 

GO:BP membrane invagination GO:0010324 2.93E-08 7.53333831 142 288 18 18123 

GO:BP regulation of body fluid levels | GO:0050878 3.13E-08 7.50472809 519 288 33 18123 

GO:BP killing by host of symbiont GO:0051873 3.37E-08 7.47216481 29 288 10 18123 
cells 

GO:BP regulation of coagulation GO:0050818 4.10E-08 7.38771455 77 288 14 18123 

GO:BP regulation of defense response | GO:0031347 4.49Е-08 7.34799289 889 288 44 18123 

GO:BP inflammatory response GO:0006954 4.72Е-08 7.3259117 962 288 46 18123 
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СО:ВР platelet aggregation GO:0070527 5.01E-08 7.30028873 64 288 13 18123 

GO:BP protein metabolic process GO:0019538 6.41E-08 7.19286818 5912 288 148 18123 

GO:BP killing of cells in other GO:0051883 7.26E-08 7.13911902 31 288 10 18123 
organism involved in 
symbiotic interaction 

GO:BP | epidermal cell differentiation | GO:0009913 | 1.19E-07 6.92453132 370 288 27 18123 

GO:BP Tesponse to fungus GO:0009620 1.55E-07 6.81097891 56 288 12 18123 

GO:BP platelet activation GO:0030168 3.02E-07 6.51970596 163 288 18 18123 

GO:BP regulation of response to GO:0048583 3.24E-07 6.48936053 4416 288 119 18123 
stimulus 

GO:BP negative regulation of GO:0043086 3.79E-07 6.42150824 841 288 4l 18123 
catalytic activity 

GO:BP negative regulation of immune | GO:0002683 6.20E-07 6.20729825 614 288 34 18123 
system process 

GO:BP multicellular organismal GO:0032501 1.1311E-06 5.94650686 8000 288 179 18123 
proce: 

СО:ВР negative regulation of defense | С0:003 1348 1.2147E-06 5.91552871 440 288 28 18123 
response 

GO:BP regulation of inflammatory GO:0050727 1.2456E-06 5.90461926 565 288 32 18123 
response 

GO:BP negative regulation of wound GO:0061045 1.5404E-06 5.81235425 83 288 13 18123 
healing 

GO:BP response to chemical GO:0042221 2.5387E-06 5.59539256 4829 288 124 18123 

СО:ВР skin development GO:0043588 2.7553E-06 5.55983692 426 288 27 18123 

GO:BP negative regulation of GO:0048585 3.5193E-06 5.45354175 1938 288 66 18123 
response to stimulus. 

GO:BP negative regulation of immune | GO:0050777 3.7643E-06 5.42431642 373 288 25 18123 
response 

GO:BP cytoskeleton organization GO:0007010 4.678E-06 5.32993969 1473 288 55 18123 

СО:ВР cellular response to cytokine GO:0071345 4.9139E-06 5.30857303 1148 288 47 18123 
stimulus 

GO:BP glucose catabolic process to GO:0061718 5.443E-06 5.26415965 24 288 8 18123 
pyruvate 

GO:BP NADH regeneration а О:0006735 5.443Е-06 5.26415965 24 288 8 18123 

GO:BP canonical glycolysis GO:0061621 5.443E-06 5.26415965 24 288 8 18123 

GO:BP regulation of response to GO:1903034 5.6385E-06 5.24883869 172 288 17 18123 
wounding 

GO:BP regulation of localization GO:0032879 5.9658E-06 5.22433385 2906 288 86 18123 

СО:ВР epidermis development GO:0008544 7.7423Е-06 5.1111281 478 288 28 18123 

GO:BP negative regulation of GO:0050728 8.5418E-06 5.06844923 359 288 24 18123 
inflammatory response 

СО:ВР inflammatory response to GO:0002437 9.4411E-06 5.02497775 251 288 20 18123 
antigenic stimulus 

GO:BP biological adhesion GO:0022610 1.1053E-05 4.95650677 1509 288 55 18123 

СО:ВР glycolytic process through GO:0061620 1.1247E-05 4.9489588 26 288 8 18123 
glucose-6-phosphate 

GO:BP cellular protein metabolic GO:0044267 1.213E-05 4.91615488 5280 288 130 18123 
process 

СО:ВР теѕропѕе to oxidative stress GO:0006979 1.2536E-05 4.90185358 457 288 27 18123 

GO:BP protein folding GO:0006457 1.2939E-05 4.88810859 230 288 19 18123 

GO:BP regulation of protein GO:0051246 1.3662E-05 486449507 2759 288 82 18123 
metabolic process 

GO:BP regulation of wound healing GO:0061041 1.3673E-05 4.86415129 138 288 15 18123 

GO:BP negative regulation of GO:0048519 1.3965E-05 4.85495218 5861 288 140 18123 
biological process 

GO:BP negative regulation of GO:1903035 1.4361E-05 4.84281669 99 288 13 18123 
response to wounding 

СО:ВР glycolytic рго through GO:0061615 1.5764E-05 4.80234299 27 288 8 18123 
fructose-6-phosphate 

СО:ВР тевропве to cytokine GO:0034097 2.064E-05 4.68529864 1242 288 48 18123 

GO:BP blood coagulation, fibrin clot GO:0072378 2.1767E-05 4.66220849 28 288 8 18123 
formation 

СО:ВР protein activation cascade GO:0072376 2.1767E-05 4.66220849 28 288 8 18123 

GO:BP cellular component GO:0016043 2.1888E-05 4.6598009 6661 288 153 18123 


organization 
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GO:BP regulation of multicellular GO:0051239 2.2835E-05 4.64139565 2888 288 84 18123 
organismal process 

GO:BP negative regulation of protein GO:0051248 3.0219E-05 4.51972279 1216 288 47 18123 
metabolic process 

GO:BP defense response to Gram- GO:0050829 3.6158E-05 4.44179494 88 288 12 18123 
negative bacterium 

GO:BP regulation of inflammatory GO:0002861 3.8421E-05 4.41543145 220 288 18 18123 
response to antigenic stimulus 

GO:BP response to inorganic GO:0010035 4.0687E-05 4.39054214 583 288 30 18123 
substance 

GO:BP positive regulation of cellular GO:0051130 4.3066E-05 4.36586201 1189 288 46 18123 
component organization 

GO:BP regulation of cellular protein GO:0032268 4.8778E-05 4.3117751 2591 288 ЧҮ. 18123 
metabolic process 

GO:BP epithelial cell differentiation GO:0030855 5.6427E-05 4.24851395 807 288 36 18123 

GO:BP regulation of cell activation GO:0050865 5.7599E-05 4.23958379 662 288 32 18123 

GO:BP response to organic substance GO:0010033 6.1088E-05 4.21404553 3457 288 94 18123 

GO:BP actin filament-based process GO:0030029 6.2008E-05 4.2075508 810 288 36 18123 

GO:BP cell adhesion GO:0007155 6.657E-05 4.17671946 1502 288 58 18123 

GO:BP multicellular organismal GO:0048871 8.1511E-05 4.08878377 533 288 28 18123 
homeostasis 

GO:BP B cell receptor signaling GO:0050853 8.1582Е-05 4.08840479 135 288 14 18123 
pathway 

GO:BP negative regulation of GO:0002862 8.5682E-05 4.06710831 206 288 17 18123 
inflammatory response to 
antigenic stimulus 

GO:BP cell-cell adhesion GO:0098609 8.9323E-05 4.04903442 898 288 38 18123 

GO:BP glucose catabolic process GO:0006007 9.0753E-05 4.04214029 33 288 8 18123 

GO:BP negative regulation of cellular | GO:0032269 0.00010147 3.99365808 1141 288 44 18123 
protein metabolic process 

GO:BP cellular response to chemical GO:0070887 0.00010521 3.97796099 3443 288 93 18123 
stimulus 

GO:BP negative regulation of GO:0044092 0.00011364 3.94447939 1228 288 46 18123 
molecular function 

GO:BP regulation of catalytic activity | GO:0050790 0.00012203 3.9135443 2452 288 73 18123 

GO:BP respiratory burst GO:0045730 0.00014966 3.82490698 35 288 8 18123 

GO:BP pentose biosynthetic process GO:0019322 0.00015619 3.80635128 4 288 4 18123 

GO:BP plasminogen activation GO:0031639 0.00016335 3.7868718 24 288 7 18123 

GO:BP intermediate filament GO:0045109 0.00016335 3.7868718 24 288 7 18123 
organization 

GO:BP NAD metabolic process GO:0019674 0.00016792 3.77490567 49 288 9 18123 

GO:BP Fc-epsilon receptor signaling GO:0038095 0.00018229 3.7392477 167 288 15 18123 
pathway 

GO:BP amyloid fibril formation GO:1990000 0.0001952 3.70952689 102 288 12 18123 

GO:BP regulation of hydrolase GO:0051336 0.00020144 3.69584372 1252 288 46 18123 
activity 

GO:BP intermediate filament GO:0045104 0.00020195 3.69475786 50 288 9 18123 
cytoskeleton organization 

GO:BP signal transduction GO:0007165 0.00021291 3.67179963 6447 288 146 18123 

GO:BP protein stabilization GO:0050821 0.00023647 3.62621927 195 288 16 18123 

GO:BP intermediate filament-based GO:0045103 0.00024184 3.61647467 51 288 9 18123 
process 

GO:BP cellular component GO:0071840 0.00025093 3.60044527 6880 288 153 18123 
organization or biogenesis 

GO:BP modulation of process of other | GO:0035821 0.00028488 3.5453316 149 288 14 18123 
organism 

GO:BP glucose 6-phosphate GO:0051156 0.00030212 3.51981757 26 288 7 18123 
metabolic process 

GO:BP actin cytoskeleton GO:0030036 0.00030806 3.51136134 712 288 32 18123 
organization 

GO:BP positive regulation of B cell GO:0050871 0.00033663 3.47284151 151 288 14 18123 
activation 

GO:BP mucosal immune response GO:0002385 0.00037022 3.43153671 39 288 8 18123 

GO:BP regulation of biological GO:0050789 0.00037878 3.42161678 12122 | 288 232 18123 
process 
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GO:BP detection of chemical stimulus | GO:0001580 0.00045645 3.34061041 40 288 8 18123 
involved in sensory perception 
of bitter taste. 

GO:BP actin nucleation GO:0045010 0.00047811 3.32047475 55 288 9 18123 

GO:BP sitive regulation of gene GO:0010628 0.00049025 3.30958618 1206 288 44 18123 
expression 

GO:BP actin filament organization GO:0007015 0.00050761 3.29446592 445 288 24 18123 

СО:ВР innate immune response іп GO:0002227 0.00052921 3.27637378 28 288 7 18123 
mucosa 

GO:BP negative GO:0051918 0.00057783 3.23819827 10 288 5 18123 
fibrinolysis 

GO:BP NADPH regeneration GO:0006740 0.00060433 3.21872742 18 288 6 18123 

GO:BP zymogen activation GO:0031638 0.00076803 3.11462035 58 288 9 18123 

GO:BP NADH metabolic process GO:0006734 0.00082582 3.08311583 43 288 8 18123 

GO:BP organ or tissue specific GO:0002251 0.00082582 3.08311583 43 288 8 18123 
immune response 

СО:ВР regulation of protein stability GO:0031647 0.00098731 3.00554685 302 288 19 18123 

GO:BP sensory perception of bitter GO:0050913 0.00099553 3.001946 44 288 8 18123 
taste 

GO:BP regulation of apoptotic GO:2001233 0.00105445 2.97697483 365 288 21 18123 
signaling pathway 

GO:BP positive regulation of GO:0051050 0.00111565 2.95247202 951 288 37 18123 
transport 

GO:BP hydrogen peroxide catabolic GO:0042744 0.00112826 2.94758955 31 288 7 18123 
process 

GO:BP cellular oxidant detoxification | GO:0098869 0.00123124 2.90965828 99 288 11 18123 

GO:BP regulation of programmed cell | GO:0043067 0.00131897 2.87976543 1603 288 52 18123 
death 

СО:ВР detection of chemical stimulus | GO:0050912 0.00142587 2.84591914 46 288 8 18123 
involved in sensory perception 
of taste 

GO:BP regulation of transport GO:0051049 0.00148793 2.82741805 1839 288 57 18123 

GO:BP regulation of B cell activation GO:0050864 0.00156999 2.80410279 197 288 15 18123 

СО:ВР regulation of cell GO:0022604 0.00162028 2.79041049 312 288 19 18123 
morphogen 

СО:ВР regulation of anatomical GO:0022603 0.00162118 2.79016965 1048 288 39 18123 
structure morphogenesis 

СО:ВР homeostatic process GO:0042592 0.00163013 2.78777741 1985 288 60 18123 

GO:BP regulation of leukocyte GO:0002694 0.00164373 2.78417027 617 288 28 18123 
activation 

GO:BP regulation of cytokine GO:0001817 0.00169109 2.77183405 847 288 34 18123 
production 

GO:BP interleukin-12-mediated GO:0035722 0.00169487 2.77086368 47 288 8 18123 
signaling pathway 

GO:BP cellular response to stimulus GO:0051716 0.00185323 2.73207082 7873 288 166 18123 

СО:ВР extracellular matrix GO:0022617 0.00192208 2.71622794 83 288 10 18123 
disassembly 

GO:BP biological regulation GO:0065007 0.00196686 2.70622669 12802 | 288 239 18123 

GO:BP regulation of molecular GO:0065009 0.00201045 2.6967061 3191 288 84 18123 
function 

СО:ВР cytokine production GO:0001816 0.00203737 2.69093083 854 288 34 18123 

СО:ВР regulation of cellular GO:0051128 0.00221247 2.65512338 2439 288 69 18123 
component organization 

СО:ВР positive regulation of response | СО:0032103 0.00222278 2.65310297 518 288 25 18123 
to external stimulus 

GO:BP regulation of cellular process GO:0050794 0.00223691 2.65035127 11517 | 288 221 18123 

GO:BP induction of bacterial GO:0043152 0.00228445 2.64121774 6 288 4 18123 
agglutination 

СО:ВР cellular response to GO:0071349 0.0023647 2.62622324 49 288 8 18123 
interleukin-12 

СО:ВР positive regulation of GO:0010638 0.00248161 2.60526657 630 288 28 18123 
organelle organization 

СО:ВР regulation of lymphocyte GO:0051249 0.00264694 2.57725581 523 288 25 18123 
activation 

СО:ВР sensory perception of taste GO:0050909 0.00272159 2.5651775 67 288 9 18123 

СО:ВР response to interleukin-12 GO:0070671 0.00277666 2.55647774 50 288 8 18123 
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GO:BP reactive oxygen species GO:0072593 0.00279685 2.55333142 234 288 16 18123 
metabolic process 

GO:BP regulation of cell death GO:0010941 0.00303181 2.51829753 1740 288 54 18123 

GO:BP chaperone-mediated protein GO:0051131 0.00307345 2.51237392 23 288 6 18123 
complex assembly 

GO:BP protein refolding GO:0042026 0.00307345 2.51237392 23 288 6 18123 

GO:BP regulation of fibrinolysis GO:0051917 0.00435667 2.36084566 14 288 5 18123 

GO:BP cellular detoxification GO:1990748 0.00466545 2.33110664 113 288 п 18123 

GO:BP positive regulation of GO:1902905 0.00472997 2.32514118 215 288 15 18123 
supramolecular fiber 
organization 

GO:BP hexose catabolic process GO:0019320 0.00509309 2.29301827 54 288 8 18123 

GO:BP regulation of vesicle-mediated | GO:0060627 0.00537217 2.26985005 544 288 25 18123 
transport 

GO:BP Arp2/3 complex-mediated GO:0034314 0.0059192 222773718 39 288 7 18123 
actin nucleation 

GO:BP cell communication GO:0007154 0.00634816 2.19735247 6961 288 149 18123 

GO:BP positive regulation of response | GO:1903036 0.00638403 2.19490477 74 288 9 18123 
to wounding 

GO:BP Iymphocyte activation GO:0046649 0.00669968 2.17394584 780 288 31 18123 

GO:BP hydrogen peroxide metabolic GO:0042743 0.00676476 2.16974749 56 288 8 18123 
process 

GO:BP cellular response to organic GO:0071310 0.00693718 2.15881718 2821 288 75 18123 
substance 

GO:BP response to metal ion GO:0010038 0.00697248 2.15661258 277 288 20 18123 

GO:BP positive regulation of protein GO:0032273 0.00715303 2.14550971 142 288 12 18123 
polymerization. 

GO:BP regulation of cell-substrate GO:0010810 0.00743906 2.12848194 223 288 15 18123 
adhesion 

GO:BP tissue migration GO:0090130 0.00786055 2.10454724 380 288 20 18123 

GO:BP response to reactive oxygen GO:0000302 0.00786065 2.1045418 224 288 15 18123 
spec 

СО:ВР positive regulation of GO:0002696 0.00792838 2.10081542 414 288 21 18123 
leukocyte activation 

GO:BP negative regulation of GO:0043069 0.00844185 2.07356259 952 288 35 18123 
programmed cell death 

GO:BP e regulation of cellular GO:0048522 0.00851111 2.07001364 5863 288 130 18123 
process 

GO:BP signaling GO:0023052 0.00891812 2.04972647 6937 288 148 18123 

GO:BP cellular response to toxic GO:0097237 0.00915809 2.03819504 121 288 п 18123 
substance 

GO:BP regulation of respiratory burst GO:0060263 0.00926038 2.03337097 16 288 5 18123 

GO:BP pentose-phosphate shunt GO:0006098 0.00926038 2.03337097 16 288 5 18123 

GO:BP apoptotic process GO:0006915 0.00948456 2.02298278 1998 288 58 18123 

GO:BP monosaccharide biosynthetic GO:0046364 0.00984647 2.00671945 99 288 10 18123 
proc 

GO:BP positive regulation of GO:1900026 0.00996774 2.00140336 42 288 7 18123 
substrate adhesion-dependent 
cell spreading 

GO:BP negative regulation of GO:0051241 0.01026828 1.9885025 1174 288 40 18123 
multicellular organismal 
process 

GO:BP external encapsulating GO:0045229 0.01067889 1.97147394 422 288 21 18123 
structure organization 

GO:BP monosaccharide catabolic GO:0046365 0.01153315 1.93805206 60 288 8 18123 
process 

GO:BP cell redox homeostasis GO:0045454 0.01174533 1.93013462 43 288 7 18123 

GO:BP positive regulation of cell GO:0050867 0.01235657 1.90810218 426 288 21 18123 
activation 

GO:BP positive regulation of cell- GO:0010811 0.01257533 1.90048077 125 288 п 18123 
substrate adhesion 

GO:BP negative regulation of cellular. | GO:0048523 0.01268534 1.89669779 5153 288 117 18123 
process 

GO:BP negative regulation of GO:0043066 0.01270119 1.89615561 928 288 34 18123 
apoptotic process 

GO:BP apoptotic signaling pathway GO:0097190 0.01274108 1.89479359 609 288 26 18123 
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GO:BP anatomical s GO:0009653 0.01274496 1.89466146 2816 288 74 18123 
morphogenesis 

GO:BP positive regulation of actin GO:0030838 0.01291088 1.88904409 102 288 10 18123 
filament polymerization 

GO:BP cell junction organization GO:0034330 0.013376 1.87367362 727 288 29 18123 

GO:BP positive regulation of protein- GO:0031334 0.01402912 1.85296945 265 288 16 18123 
containing complex assembly. 

СО:ВР post-translational protein GO:0043687 0.01448382 1.83911678 362 288 19 18123 
modification 

GO:BP anatomical structure GO:0048856 0.01601036 1.79559892 6164 288 134 18123 
development 

GO:BP regulation of apoptotic GO:0042981 0.01672779 1.77656138 1560 288 48 18123 
proce: 

GO:BP positive regulation of GO:0051251 0.01694871 1.77086323 366 288 19 18123 
lymphocyte activation 

СО:ВР | angiogenesis GO:0001525 | 0.01802467 | 174413274 621 288 26 18123 

GO:BP epithelium development GO:0060429 0.01880272 1.72577925 1338 288 43 18123 

GO:BP tissue development GO:0009888 0.01915473 1.71772385 2093 288 59 18123 

GO:BP regulation of actin filament- GO:0032970 0.01959768 1.70779538 404 288 20 18123 
based process 

GO:BP cytokine-mediated signaling GO:0019221 0.02029365 1.69263978 824 288 31 18123 
pathway 

СО:ВР epithelial cell migration GO:0010631 0.02055889 1.68700029 371 288 19 18123 

СО:ВР protein polymerization GO:0051258 0.02254642 1.6469224 307 288 17 18123 

GO:BP epithelium migration GO:0090132 0.02304407 1.63744085 374 288 19 18123 

GO:BP protein-containing complex GO:0065003 0.02465889 1.60802656 1724 288 51 18123 
assembly 

СО:ВР cell chemotaxis GO:0060326 0.02560235 1.59172022 310 288 17 18123 

GO:BP regulation of establishment of | GO:0070201 0.02561607 1.59148748 595 288 25 18123 
protein localization 

СО:ВР ephrin receptor signaling GO:0048013 0.02718343 1.56569571 88 288 9: 18123 
pathway 

СО:ВР regulation of cell adhesion GO:0030155 0.02813063 1.55082056 756 288 29 18123 

GO:BP response to cadmium ion GO:0046686 0.02985421 1.52499448 68 288 8 18123 

GO:BP establishment or maintenance GO:0007163 0.03001834 1.52261329 220 288 14 18123 
of cell polarity 

GO:BP myeloid leukocyte migration GO:0097529 0.03001834 1.52261329 220 288 14 18123 

GO:BP regulation of actin GO:0008064 0.03010199 1.52140479 191 288 13 18123 
polymerization or 
depolymerization 

СО:ВР neutrophil-mediated killing of | GO:0070943 0.03041074 1.51697298 10 288 4 18123 
symbiont cell 

GO:BP actin polymerization or GO:0008154 0.0315984 1.50033491 221 288 14 18123 
depolymerization 

GO:BP regulation of actin filament GO:0030832 0.0318358 1.49708422 192 288 13 18123 
length 

СО:ВР leukocyte mediated GO:0001909 0.03228312 1.49102446 113 288 10 18123 
cytotoxicity 

СО:ВР extracellular matrix GO:0030198 0.03338679 1.47642539 419 288 20 18123 
organization. 

СО:ВР extracellular structure а О:0043062. 0.03456012. 1.46142479 420 288 20 18123 
organization 

GO:BP Tesponse to toxic substance GO:0009636 0.03585527 1.44544698 254 288 15 18123 

GO:BP regulation of protein- GO:0043254 0.03713534 1.43021263 458 288 21 18123 
containing complex assembly 

GO:BP acute inflammatory response GO:0002526 0.03769252 1.42374485 115 288 10 18123 

СО:ВР regulation of intrinsic GO:2001242 0.03790212 1.42133646 167 288 12 18123 
apoptotic signaling pathway 

GO:BP system development GO:0048731 0.03847193 1.41485605 5095 288 114 18123 

СО:ВР ameboidal-type cell migration | GO:0001667 0.03878637 1.41132081 496 288 22 18123 

GO:BP response to ethanol GO:0045471 0.03985333 1.39953535 141 288 11 18123 

СО:ВР antigen receptor-mediated GO:0050851 0.0402315 1.39543375 321 288 17 18123 
signaling pathway 

GO:BP detoxification GO:0098754 0.0455075 1.34191703 143 288 11 18123 
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СО:ВР positive regulation of GO:0048260 0.04905683 1.30930056 53 288 7 18123 
receptor-mediated endocytosis 
GO:BP NADP metabolic process GO:0006739 0.04985034 1.3023319 36 288 6 18123 
GO:BP immune effector process GO:0002252 2.50E-78 77.6024241 1135 288 132 18123 
GO:BP leukocyte mediated immunity GO:0002443 1.56E-77 76.8074492 888 288 120 18123 
GO:BP regulated exocytosis GO:0045055 5.05E-75 74.2966595 790 288 113 18123 
GO:BP exocytosis GO:0006887 8.87E-72 71.0520087 906 288 116 18123 
GO:BP immune response GO:0006955 2.26E-68 67.6464266 2544 288 170 18123 
GO:BP vesicle-mediated transport GO:0016192 1.89E-66 65.7242567 2193 288 158 18123 
GO:BP neutrophil mediated immunity | GO:0002446 1.44E-65 64.840947 500 288 89 18123 
GO:BP neutrophil degranulation GO:0043312 2.23E-64 63.6522074 483 288 87 18123 
GO:BP neutrophil activation involved | GO:0002283 5.64E-64 63.2484677 488 288 87 18123 
in immune response 
GO:BP neutrophil activation GO:0042119 7.19E-63 62.1433144 502 288 87 18123 
GO:BP leukocyte degranulation GO:0043299 1.19E-62 61.9233427 538 288 89 18123 
GO:BP myeloid leukocyte mediated GO:0002444 1.96E-62 61.7074614 558 288 90 18123 
immunity 
GO:BP granulocyte activation GO:0036230 2.49E-62 61.6041998 509 288 87 18123 
GO:BP cell activation GO:0001775 1.08E-60 59.9681373 1507 288 130 18123 
GO:BP myeloid cell activation GO:0002275 2.06E-60 59.686503 552 288 88 18123 
involved in immune response 
GO:BP immune system process GO:0002376 1.53Е-59 58.8156547 3438 288 183 18123 
GO:BP secretion by cell GO:0032940 2.45Е-57 56.6101168 1351 288 121 18123 
GO:BP myeloid leukocyte activation GO:0002274 2.49E-56 55.6037848 671 288 91 18123 
GO:BP leukocyte activation involved GO:0002366 1.21Е-55 54.9159717 723 288 93 18123 
іп immune response 
GO:BP cell activation involved in GO:0002263 2.01E-55 54.6970091 727 288 93 18123 
immune response 
GO:BP export from cell GO:0140352 3.88E-55 54.4107173 1413 288 121 18123 
GO:BP secretion GO:0046903 2.17E-54 53.6632118 1492 288 123 18123 
GO:BP leukocyte activation GO:0045321 5.97E-50 49.224289 1341 288 113 18123 
GO:BP humoral immune response GO:0006959 8.06E-45 44.0934814 383 288 65 18123 
GO:BP transport GO:0006810 1.18E-37 36.9299465 5214 288 192 18123 
GO:BP establishment of localization GO:0051234 3.98E-37 36.4006131 5364 288 194 18123 
GO:BP establishment of localization GO:0051649 1.71E-32 31.7672174 2778 288 133 18123 
in cell 
GO:BP antimicrobial humoral GO:0019730 1.95E-32 31.7096374 144 288 38 18123 
response 
GO:BP defense response GO:0006952 2.39E-31 30.6214787 1956 288 110 18123 
GO:BP biological process involved іп | GO:0044419 2.16E-29 28.6647252 1768 288 102 18123 
interspecies interaction 
between organisms 
GO:BP response to external stimulus GO:0009605 3.74E-29 28.4275491 3114 288 136 18123 
GO:BP Tesponse to stress GO:0006950 1.50E-28 27.8227604 4321 288 161 18123 
GO:BP response to other organism GO:0051707 2.10E-26 25.6773767 1600 288 93 18123 
GO:BP response to external biotic GO:0043207 2.21E-26 25.6564687 1601 288 93 18123 
stimulus 
GO:BP response to biotic stimulus GO:0009607 2.62Е-26 25.5818974 1639 288 94 18123 
GO:BP localization GO:0051179 4.11E-26 25.3866372 6938 288 204 18123 
GO:BP cellular localization GO:0051641 5.81E-26 25.2357235 3519 288 140 18123 
GO:BP phagocytosis GO:0006909 1.52Е-25 24.8191105 388 288 48 18123 
GO:BP defense response to other GO:0098542 2.48E-25 24.605158 1222 288 80 18123 
organism. 
СО:ВР defense response to bacterium | GO:0042742 1.58Е-24 23.8012777 349 288 45 18123 
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GO:BP peptide cross-linking GO:0018149 2.57E-20 19.5899207 96 288 25 18123 

GO:BP complement activation GO:0006956 4.71E-20 19.3269671 178 288 31 18123 

GO:BP response to stimulus GO:0050896 8.55Е-20 19.0680445 9487 288 231 18123 

GO:BP response to bacterium GO:0009617 1.37E-18 17.8630065 743 288 55 18123 

СО:ВР endocytosis а О:0006897 3.08E-18 17.5111988 700 288 53 18123 

GO:BP positive regulation of immune | GO:0050778 3.31E-18 17.4805084 757 288 55 18123 
response 

GO:BP sitive regulation of immune | GO:0002684 9.20E-18 17.0362858 1079 288 65 18123 

m process 

СО:ВР activation of immune response | СО:0002253 1.86E-17 16.7300364 566 288 47 18123 

GO:BP innate immune response GO:0045087 2.47Е-17 16.6068267 1003 288 62 18123 

GO:BP humoral immune response GO:0002455 2.59E-17 16.5863754 153 288 27 18123 
mediated by circulating 
immunoglobulin 

СО:ВР complement activation, а О:0006958 3.57Е-17 16.447622 140 288 26 18123 

GO:BP gamma receptor signaling GO:0038096 2.15E-16 15.6667281 135 288 25 18123 
pathway involved in 
phagocytosis 

СО:ВР immune response-regulating GO:0002433 2.15E-16 15.6667281 135 288 25 18123 
cell surface receptor signaling 
pathway involved in 
phagocytosis 

GO:BP receptor-mediated endocytosis | GO:0006898 2.31E-16 15.6367546 335 288 36 18123 

GO:BP Fc-gamma receptor signaling GO:0038094 3.14E-16 15.5032741 137 288 25 18123 
pathway 

GO:BP regulation of immune system GO:0002682 3.68E-16 15.4346711 1734 288 81 18123 
process 

GO:BP Fc receptor mediated GO:0002431 6.54E-16 15.1847034 141 288 25 18123 
stimulatory signaling pathway 

СО:ВР platelet degranulation GO:0002576 7.30E-16 15.1368364 127 288 24 18123 

СО:ВР regulation of immune GO:0050776 1.75Е-15 14.7569082. 1159 288 64 18123 
response 

GO:BP phagocytosis, recognition GO:0006910 2.27E-15 14.6440186 133 288 24 18123 

СО:ВР lymphocyte mediated GO:0002449 2.51Е-15 14.6007503 360 288 36 18123 
immunity 

GO:BP cornification GO:0070268 9.78E-15 14.0094718 112 288 22 18123 

GO:BP immune response-activating GO:0002429 1.21Е-14 13.9186618 475 288 40 18123 
cell surface receptor signaling 
pathway 

GO:BP immune response-activating GO:0002757 1.21Е-14 13.9186618 475 288 40 18123 
signal transduction 

GO:BP immune response-regulating GO:0002768 2.35E-14 13.6292597 510 288 4l 18123 
cell surface receptor signaling 
pathway 

GO:BP immune response-regulating GO:0002764 2.89E-14 13.5383815 513. 288 4l 18123 
signaling pathway 

СО:ВР retina homeostasis GO:0001895 3.44E-14 13.4636898 79 288 19 18123 

GO:BP immunoglobulin mediated GO:0016064 4.56E-14 13.3413924 221 288 28 18123 
immune response 

GO:BP B cell mediated immunity GO:0019724 6.53E-14 13.1849054 224 288 28 18123 

СО:ВР negative regulation of GO:0010466 7.45Е-14 13.1280982 265 288 30 18123 
peptidase activity 

GO:BP regulation of complement GO:0030449 1.07E-13 12.9708018 110 288 21 18123 
activation 

GO:BP antibacterial humoral response | GO:0019731 1.17E-13 12.9304088 61 288 17 18123 

GO:BP leukocyte migration GO:0050900 1.54E-13 12.8114101 51 288 40 18123 

GO:BP negative regulation of GO:0010951 1.66E-13 12.7807013 252 288 29 18123 
endopeptidase activity 

GO:BP regulation of response to GO:0032101 1.94E-13 12.7129891 1238 288 63 18123 
external stimulus 

GO:BP cell killing GO:0001906 2.14E-13 12.6692194 196 288 26 18123 

GO:BP regulation of peptidase GO:0052547 2.77E-13 12.557545 467 288 38 18123 
activity 

GO:BP Fc receptor signaling pathway | GO:0038093 3.26E-13 12.4863695 238 288 28 18123 
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GO:BP regulation of response to GO:0080134 3.44E-13 12.4635056 1626 288 73 18123 
str 
GO:BP positive regulation of response | GO:0048584 3.77E-13 12.4240307 2368 288 91 18123 
to stimulus 
GO:BP regulation of humoral immune | GO:0002920 5.44E-13 12.2645441 134 288 22 18123 
response 
GO:BP negative regulation of GO:0045861 7.42Е-13 12.129755 356 288 33 18123 
proteolysi 
GO:BP cell recognition GO:0008037 1.18Е-12 11.9282087 271 288 29 18123 
СО:ВР adaptive immune response GO:0002460 1.68Е-12 11.7746189 366 288 33 18123 
based оп somatic 
recombination of immune 
receptors built from 
immunoglobulin superfamily 
domains 
GO:BP programmed cell death GO:0012501 4.28Е-12 11.3688604 2158 288 84 18123 
GO:BP negative regulation of GO:0032102 7.28E-12 11.1376621 601 288 4l 18123 
response to external stimulus. 
GO:BP regulation of endopeptidase GO:0052548 7.98E-12 11.098218 437 288 35 18123 
activity 
GO:BP killing of cells of other GO:0031640 8.54E-12 11.0685313 77 288 T7 18123 
organism 
GO:BP | regulation of proteolysis С0:0030162 | 9.00E-12 11.0459398 756 288 46 18123 
GO:BP cell death GO:0008219 2.16E-11 10.6648525 2306 288 86 18123 
GO:BP membrane organization GO:0061024 3.21E-11 10.4935133 1016 288 53 18123 
GO:BP | locomotion GO:0040011 | 5.25E-11 10.2800628 2039 |288 79 18123 
GO:BP regulation of biological GO:0065008 5.97E-11 10.2237029 4116 288 123 18123 
quality 
GO:BP cell motility GO:0048870 6.91E-11 10.1603288 1840 288 74 18123 
GO:BP | localization of cell G0:0051674 | 6.91E-11 10.1603288 1840 | 288 74 18123 
GO:BP proteolysis GO:0006508 9.40E-11 10.0267583 1851 288 74 18123 
GO:BP hemostasis GO:0007599 2.12E-10 9.67405228 355 288 30 18123 
GO:BP biological process involved їп | GO:0044403 2.24Е-10 9.64989467 331 288 29 18123 
symbiotic interaction 
GO:BP movement of cell or GO:0006928 2.63E-10 9.58000022 2319 288 84 18123 
subcellular component 
GO:BP opsonization GO:0008228 4.75Е-10 9.32345989 69 288 15 18123 
GO:BP keratinization GO:0031424 4.76Е-10 9.32273516 225 288 24 18123 
GO:BP | adaptive immune response GO:0002250 | 5.37E-10 9,26986814 715 288 42 18123 
GO:BP regulation of immune effector | GO:0002697 5.62E-10 9.25012504 421 288 32 18123 
GO:BP nvolvedin | GO:0051702 7.02E-10 9.15337235 99 288 17 18123 
interaction with symbiont 
GO:BP negative regulation of GO:0051346 9.13E-10 9.03960435 456 288 33 18123 
hydrolase activity. 
GO:BP blood coagulation GO:0007596 9.97E-10 9.00121333 351 288 29 18123 
GO:BP supramolecular fiber GO:0097435 1.10Е-09 8.96026071 863 288 46 18123 
organization 
GO:BP cell surface receptor signaling | GO:0007166 1.25E-09 8.90392992 3226 288 102 18123 
pathway 
GO:BP keratinocyte differentiation GO:0030216 1.41E-09 8.84929668 306 288 27 18123 
GO:BP coagulation GO:0050817 1.64E-09 8.78497833 358 288 29 18123 
GO:BP wound healing GO:0042060 1.87E-09 8.72731318 555 288 36 18123 
GO:BP response to wounding GO:0009611 2.05E-09 8.68879389 680 288 40 18123 
GO:BP phagocytosis, engulfment GO:0006911 3.20E-09 8.49482689 125 288 18 18123 
GO:BP defense response to fungus GO:0050832 3.73E-09 8.42820974 42 288 12 18123 
GO:BP tissue homeostasis GO:0001894 4.29E-09 8.36721603 272 288 25 18123 
GO:BP cell migration GO:0016477 4.34E-09 8.36256424 1660 288 66 18123 
GO:BP antimicrobial humoral GO:0061844 5.75E-09 8.24026335 81 288 15 18123 
immune response mediated by 
antimicrobial peptide 
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GO:BP negative regulation of blood GO:0030195 6.29E-09 8.20161699 55 288 13 18123 
coagulation 

GO:BP negative regulation of GO:1900047 8.07E-09 8.09305825 56 288 13 18123 
hemostasis 

GO:BP fibrinolysis GO:0042730 9.33E-09 8.03032445 26 288 10 18123 

GO:BP regulation of blood GO:0030193 1.04E-08 7.9826602 70 288 14 18123 
coagulation 

GO:BP plasma membrane GO:0099024 1.08E-08 7.96770377 134 288 18 18123 
invagination 

GO:BP regulation of hemostasis GO:1900046 1.56Е-08 7.80571485 72 288 14 18123 

СО:ВР homotypic cell-cell adhesion GO:0034109 1.71E-08 7.7678501 87 288 15 18123 

GO:BP positive regulation of GO:0048518 2.15E-08 7.66680788 6474 288 159 18123 
biological process 

СО:ВР organonitrogen compound GO:1901564 2.58E-08 7.58776195 6904 288 166 18123 
metabolic process 

СО:ВР negative regulation of GO:0050819 2.61E-08 7.58298592 61 288 13 18123 
coagulation 

GO:BP anatomical structure а О:0060249 2.63Е-08 7.57940677 486 288 32 18123 
homeostasis 

GO:BP membrane invagination GO:0010324 2.93E-08 7.53333831 142 288 18 18123 

GO:BP regulation of body fluid levels | GO:0050878 3.13E-08 7.50472809 519 288 33 18123 

GO:BP killing by host of symbiont GO:0051873 3.37E-08 7.47216481 29 288 10 18123 
cells 

GO:BP regulation of coagulation GO:0050818 4.10E-08 7.38771455 7] 288 14 18123 

GO:BP | regulation of defense response | GO:0031347 | 4.49E-08 734799289 889 288 44 18123 

GO:BP inflammatory response GO:0006954 4.72Е-08 7.3259117 962 288 46 18123 

СО:ВР platelet aggregation GO:0070527 5.01E-08 7.30028873 64 288 13 18123 

GO:BP protein metabolic process GO:0019538 6.41E-08 7.19286818 5912 288 148 18123 

GO:BP killing of cells in other GO:0051883 7.26E-08 7.13911902 31 288 10 18123 
organism involved in 
symbiotic interaction 

GO:BP epidermal cell differentiation GO:0009913 1.19E-07 6.92453132 370 288 27 18123 

СО:ВР теѕропѕе to fungus GO:0009620 1.55E-07 6.81097891 56 288 12 18123 

GO:BP | platelet activation GO:0030168 | 3.02Е-07 6.51970596 163 288 18 18123 

GO:BP regulation of response to GO:0048583 3.24E-07 6.48936053 4416 288 119 18123 
stimulus 

СО:ВР negative regulation of GO:0043086 3.79E-07 6.42150824 841 288 4l 18123 
catalytic activity: 

GO:BP negative regulation of immune | GO:0002683 6.20E-07 6.20729825 614 288 34 18123 
system process 

GO:BP multicellular organismal GO:0032501 1.1311E-06 5.94650686 8000 288 179 18123 
process 

СО:ВР negative regulation of defense | С0:003 1348 1.2147E-06 5.91552871 440 288 28 18123 
response 

GO:BP regulation of inflammatory GO:0050727 1.2456E-06 5.90461926 565 288 32 18123 
response 

СО:ВР negative regulation of wound GO:0061045 1.5404E-06 5.81235425 83 288 13 18123 
healing 

СО:ВР теѕропѕе to chemical GO:0042221 2.5387E-06 5.59539256 4829 288 124 18123 

СО:ВР skin development GO:0043588 2.7553E-06 5.55983692 426 288 27 18123 

GO:BP negative regulation of GO:0048585 3.5193E-06 5.45354175 1938 288 66 18123 
response to stimulus 

GO:BP negative regulation of immune | GO:0050777 3.7643E-06 5.42431642 373 288 25 18123 
response 

СО:ВР cytoskeleton organization GO:0007010 4.678E-06 5.32993969 1473 288 55 18123 

СО:ВР cellular response to cytokine GO:0071345 4.9139E-06 5.30857303 1148 288 47 18123 
stimulus 

GO:BP glucose catabolic process to GO:0061718 5.443E-06 5.26415965 24 288 8 18123 
pyruvate 

GO:BP NADH regeneration GO:0006735 5.443E-06 5.26415965 24 288 8 18123 

СО:ВР canonical glycolysis GO:0061621 5.443E-06 5.26415965 24 288 8 18123 

GO:BP regulation of response to GO:1903034 5.6385E-06 5.24883869 172 288 17 18123 
wounding 

GO:BP regulation of localization GO:0032879 5.9658E-06 5.22433385 2906 288 86 18123 
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GO:BP epidermis development GO:0008544 7.7423Е-06 5.1111281 478 288 28 18123 

GO:BP negative regulation of GO:0050728 8.5418E-06 5.06844923 359 288 24 18123 
inflammatory response 

GO:BP inflammatory response to GO:0002437 9.4411E-06 5.02497775 251 288 20 18123 
antigenic stimulus 

GO:BP biological adhesion GO:0022610 1.1053E-05 4.95650677 1509 288 55 18123 

СО:ВР glycolytic process through GO:0061620 1.1247E-05 4.9489588 26 288 8 18123 
glucose-6-phosphate 

GO:BP cellular protein metabolic GO:0044267 1.213E-05 4.91615488 5280 288 130 18123 
process 

GO:BP response to oxidative stress GO:0006979 1.2536E-05 4.90185358 457 288 27 18123 

GO:BP protein folding GO:0006457 1.2939E-05 4.88810859 230 288 19 18123 

GO:BP regulation of protein GO:0051246 1.3662E-05 4.86449507 2759 288 82 18123 
metabolic process 

GO:BP regulation of wound healing GO:0061041 1.3673E-05 4.86415129 138 288 15 18123 

GO:BP negative regulation of GO:0048519 1.3965E-05 4.85495218 5861 288 140 18123 
biological process 

GO:BP negative regulation of GO:1903035 1.4361E-05 4.84281669 99 288 13 18123 
response to wounding 

GO:BP glycolytic process through GO:0061615 1.5764E-05 4.80234299 27 288 8 18123 
fructose-6-phosphate 

СО:ВР теѕропѕе to cytokine GO:0034097 2.064E-05 4.68529864 1242 288 48 18123 

GO:BP blood coagulation, fibrin clot GO:0072378 2.1767E-05 4.66220849 28 288 8 18123 
formation 

GO:BP protein activation cascade GO:0072376 2.1767E-05 4.66220849 28 288 8 18123 

GO:BP cellular component GO:0016043 2.1888E-05 4.6598009 6661 288 153 18123 
organization 

GO:BP regulation of multicellular GO:0051239 2.2835E-05 4.64139565 2888 288 84 18123 
organismal pro 

GO:BP negative regulation of protein GO:0051248 3.0219E-05 4.51972279 1216 288 47 18123 
metabolic process 

GO:BP defense response to Gram- GO:0050829 3.6158E-05 4.44179494 88 288 12 18123 
negative bacterium 

GO:BP regulation of inflammatory GO:0002861 3.8421E-05 4.41543145 220 288 18 18123 

onse to antigenic stimulus 

GO:BP sponse to inorganic GO:0010035 4.0687E-05 4.39054214 583 288 30 18123 
substance 

GO:BP positive regulation of cellular GO:0051130 4.3066E-05 4.36586201 1189 288 46 18123 
component organization 

GO:BP regulation of cellular protein GO:0032268 4.8778E-05 4.3117751 2591 288 in 18123 
metabolic process, 

GO:BP epithelial cell differentiation GO:0030855 5.6427E-05 4.24851395 807 288 36 18123 

GO:BP regulation of cell activation GO:0050865 5.7599E-05 4.23958379 662 288 32 18123 

GO:BP response to organic substance GO:0010033 6.1088E-05 4.21404553 3457 288 94 18123 

GO:BP actin filament-based process GO:0030029 6.2008E-05 4.2075508 810 288 36 18123 

GO:BP cell adhesion GO:0007155 6.657E-05 4.17671946 1502 288 53 18123 

GO:BP multicellular organismal GO:0048871 8.1511E-05 4.08878377 533 288 28 18123 
homeostasis 

GO:BP B cell receptor signaling GO:0050853 8.1582Е-05 4.08840479 135 288 14 18123 
pathway 

GO:BP negative regulation of GO:0002862 8.5682E-05 4.06710831 206 288 17 18123 
inflammatory response to 
antigenic stimulus 

GO:BP cell-cell adhesion GO:0098609 8.9323E-05 4.04903442 898 288 38 18123 

GO:BP glucose catabolic process GO:0006007 9.0753E-05 4.04214029 33 288 8 18123 

GO:BP negative regulation of cellular | GO:0032269 0.00010147 3.99365808 1141 288 44 18123 
protein metabolic process 

GO:BP cellular response to chemical GO:0070887 0.00010521 3.97796099 3443 288 93 18123 
stimulus 

GO:BP negative regulation of GO:0044092 0.00011364 3.94447939 1228 288 46 18123 
molecular function 

GO:BP regulation of catalytic activity | GO:0050790 0.00012203 3.9135443 2452 288 73 18123 

GO:BP respiratory burst GO:0045730 0.00014966 3.82490698 35 288 8 18123 

GO:BP pentose biosynthetic process GO:0019322 0.00015619 3.80635128 4 288 4 18123 

GO:BP plasminogen activation GO:0031639 0.00016335 3.7868718 24 288 7 18123 
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GO:BP intermediate filament GO:0045109 0.00016335 3.7868718 24 288 7 18123 
organization 

GO:BP | NAD metabolic process GO:0019674 | 0.00016792 | 377490567 49 288 9 18123 

СО:ВР Fc-epsilon receptor signaling GO:0038095 0.00018229 3.7392477 167 288 15 18123 
pathway 

GO:BP amyloid fibril formation GO:1990000 0.0001952 3.70952689 102 288 12 18123 

СО:ВР regulation of hydrolase GO:0051336 0.00020144 3.69584372 1252 288 46 18123 
activity 

GO:BP intermediate filament GO:0045104 0.00020195 3.69475786 50 288 9 18123 
cytoskeleton organization 

GO:BP Signal transduction GO:0007165 0.00021291 3.67179963 6447 288 146 18123 

СО:ВР protein stabilization GO:0050821 0.00023647 3.62621927 195 288 16 18123 

GO:BP intermediate filament-based GO:0045103 0.00024184 3.61647467 51 288 9: 18123 
process 

СО:ВР cellular component GO:0071840 0.00025093 3.60044527 6880 288 153 18123 
organization or biogenesis 

GO:BP modulation of process of other | GO:0035821 0.00028488 3.5453316 149 288 14 18123 
organism. 

GO:BP glucose 6-phosphate GO:0051156 0.00030212 3.51981757 26 288 7 18123 
metabolic process 

СО:ВР actin cytoskeleton GO:0030036 0.00030806 3.51136134 712 288 32 18123 
organization. 

СО:ВР positive regulation of B cell GO:0050871 0.00033663 3.47284151 151 288 14 18123 
activation 

СО:ВР mucosal immune response GO:0002385 0.00037022 3.43153671 39 288 8 18123 

GO:BP regulation of biological GO:0050789 0.00037878 3.42161678 12122 | 288 232 18123 
process 

GO:BP detection of chemical stimulus | GO:0001580 0.00045645 3.34061041 40 288 8 18123 
involved in sensory perception 
of bitter taste. 

СО:ВР actin nucleation GO:0045010 0.00047811 3.32047475 55 288 9 18123 

GO:BP positive regulation of gene GO:0010628 0.00049025 3.30958618 1206 288 44 18123 
expression 

GO:BP actin filament organization GO:0007015 0.00050761 3.29446592 445 288 24 18123 

GO:BP innate immune response in GO:0002227 0.00052921 3.27637378 28 288 7 18123 
mucosa 

GO:BP negative regulation of GO:0051918 0.00057783 3.23819827 10 288 5 18123 
fibrinolysis 

GO:BP NADPH regeneration GO:0006740 0.00060433 3.21872742 18 288 6 18123 

GO:BP zymogen activation GO:0031638 0.00076803 3.11462035 58 288 9 18123 

GO:BP | NADH metabolic process GO:0006734 | 0.00082582 | 3.08311583 43 288 8 18123 

СО:ВР organ ог tissue specific GO:0002251 0.00082582 3.08311583 43 288 8 18123 
immune response 

GO:BP | regulation of protein stability | GO:0031647 | 0.00098731 | 3.00554685 302 288 19 18123 

GO:BP sensory perception of bitter GO:0050913 0.00099553 3.001946 44 288 8 18123 
taste 

СО:ВР regulation of apoptotic GO:2001233 0.00105445 2.97697483 365 288 21 18123 
signaling pathway 

СО:ВР positive regulation of GO:0051050 0.00111565 2.95247202 951 288 37 18123 
transport 

GO:BP hydrogen peroxide catabolic GO:0042744 0.00112826 2.94758955 31 288 7 18123 
process 

GO:BP | cellular oxidant detoxification | GO:0098869 | 0.00123124 | 2.90965828 99 288 П 18123 

СО:ВР regulation of programmed сей | GO:0043067 0.00131897 2.87976543 1603 288 52 18123 
death 

GO:BP detection of chemical stimulus | GO:0050912 0.00142587 2.84591914 46 288 8 18123 
involved in sensory perception 
of taste 

GO:BP regulation of transport GO:0051049 0.00148793 2.82741805 1839 288 57 18123 

GO:BP | regulation of B cell activation | GO:0050864 | 0.00156999 | 2.80410279 197 288 15 18123 

СО:ВР regulation of cell GO:0022604 0.00162028 2.79041049 312 288 19 18123 
morphogenesis 

GO:BP regulation of anatomical GO:0022603 0.00162118 2.79016965 1048 288 39 18123 
structure morphogenesis 

GO:BP homeostatic process GO:0042592 0.00163013 2.78777741 1985 288 60 18123 

GO:BP regulation of leukocyte GO:0002694 0.00164373 2.78417027 617 288 28 18123 
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GO:BP regulation of cytokine GO:0001817 0.00169109 2.77183405 847 288 34 18123 
production 

GO:BP interleukin-12-mediated GO:0035722 0.00169487 2.77086368 47 288 8 18123 
signaling pathway: 

GO:BP cellular response to stimulus GO:0051716 0.00185323 2.73207082 7873 288 166 18123 

GO:BP extracellular matrix GO:0022617 0.00192208 2.71622794 83 288 10 18123 
disassembly 

GO:BP | biological regulation GO:0065007 | 0.00196686 | 2.70622669 12802 | 288 239 18123 

GO:BP regulation of molecular GO:0065009 0.00201045 2.6967061 3191 288 84 18123 
function 

GO:BP cytokine production GO:0001816 0.00203737 2.69093083 854 288 34 18123 

GO:BP regulation of cellular GO:0051128 0.00221247 2.65512338 2439 288 69 18123 
component organization 

GO:BP positive regulation of response | GO:0032103 0.00222278 2.65310297 518 288 25 18123 
to external stimulus 

СО:ВР | regulation of cellular process | GO:0050794 | 0.00223691 | 2.65035127 11517 | 288 221 18123 

GO:BP induction of bacterial GO:0043152 0.00228445 2.64121774 6 288 4 18123 
agglutination 

GO:BP cellular response to GO:0071349 0.0023647 2.62622324 49 288 8 18123 
interleukin-12 

GO:BP positive regulation of GO:0010638 0.00248161 2.60526657 630 288 28 18123 
organelle organization 

GO:BP regulation of lymphocyte GO:0051249 0.00264694 2.57725581 523 288 25 18123 
activation 

GO:BP sensory perception of taste GO:0050909 0.00272159 2.5651775 67 288 9 18123 

GO:BP | response to interleukin-12 GO:0070671 | 0.00277666 | 2.55647774 50 288 8 18123 

GO:BP reactive oxygen species GO:0072593 0.00279685 2.55333142 234 288 16 18123 
metabolic process 

GO:BP regulation of cell death GO:0010941 0.00303181 2.51829753 1740 288 54 18123 

GO:BP chaperone-mediated protein GO:0051131 0.00307345 2.51237392 23 288 6 18123 
complex assembly 

GO:BP protein refolding GO:0042026 0.00307345 2.51237392 23 288 6 18123 

GO:BP regulation of fibrinolysis GO:0051917 0.00435667 2.36084566 14 288 5 18123 

GO:BP cellular detoxification GO:1990748 0.00466545 2.33110664 113 288 11 18123 

GO:BP positive regulation of GO:1902905 0.00472997 2.32514118 215 288 15 18123 
supramolecular fiber 
organization 

GO:BP hexose catabolic process GO:0019320 0.00509309 2.29301827 54 288 8 18123 

GO:BP regulation of vesicle-mediated | GO:0060627 0.00537217 2.26985005 544 288 25 18123 
transport 

GO:BP Arp2/3 complex-mediated GO:0034314 0.0059192 222773718 39 288 7 18123 
actin nucleation 

GO:BP cell communication GO:0007154 0.00634816 2.19735247 6961 288 149 18123 

GO:BP positive regulation of response | GO:1903036 0.00638403 2.19490477 74 288 9 18123 
to wounding 

GO:BP Iymphocyte activation GO:0046649 0.00669968 2.17394584 780 288 31 18123 

GO:BP hydrogen peroxide metabolic GO:0042743 0.00676476 2.16974749 56 288 8 18123 
process 

GO:BP cellular response to organic GO:0071310 0.00693718 2.15881718 2821 288 75 18123 
substance 

GO:BP response to metal ion GO:0010038 0.00697248 2.15661258 377 288 20 18123 

GO:BP positive regulation of protein GO:0032273 0.00715303 2.14550971 142 288 12 18123 
polymerization. 

GO:BP regulation of cell-substrate GO:0010810 0.00743906 2.12848194 223 288 15 18123 
adhesion 

GO:BP tissue migration GO:0090130 0.00786055 2.10454724 380 288 20 18123 

GO:BP response to reactive oxygen GO:0000302 0.00786065 2.1045418 224 288 15 18123 
spec 

GO:BP positive regulation of GO:0002696 0.00792838 2.10081542 414 288 21 18123 
leukocyte activation 

GO:BP negative regulation of GO:0043069 0.00844185 2.07356259 952 288 35 18123 
programmed cell death 

GO:BP e regulation of cellular GO:0048522 0.00851111 2.07001364 5863 288 130 18123 
process 

GO:BP signaling GO:0023052 0.00891812 2.04972647 6937 288 148 18123 
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GO:BP cellular response to toxic GO:0097237 0.00915809 2.03819504 121 288 11 18123 
substance 

СО:ВР | regulation of respiratory burst | GO:0060263 | 0.00926038 | 2.03337097 16 288 5 18123 

GO:BP pentose-phosphate shunt GO:0006098 0.00926038 2.03337097 16 288 5 18123 

GO:BP | apoptotic process GO:0006915 | 0.00948456 | 2.02298278 1998 | 288 58 18123 

СО:ВР monosaccharide biosynthetic GO:0046364 0.00984647 2.00671945 99 288 10 18123 
process 

GO:BP sitive regulation of GO:1900026 0.00996774 2.00140336 42 288 7 18123 

te adhesion-dependent 
preading 

GO:BP negative regulation of GO:0051241 0.01026828 1.9885025 1174 288 40 18123 
multicellular organismal 
process 

GO:BP external encapsulating GO:0045229 0.01067889 1.97147394 422 288 21 18123 
Structure organization 

GO:BP monosaccharide catabolic GO:0046365 0.01153315 1.93805206 60 288 8 18123 
process 

GO:BP | cell redox homeostasis GO:0045454 | 0.01174533 | 193013462 43 288 7 18123 

GO:BP GO:0050867 0.01235657 1.90810218 426 288 21 18123 

GO:BP positive regulation of cell- GO:0010811 0.01257533 1.90048077 125 288 11 18123 
substrate adhesion 

GO:BP negative regulation of cellular. | GO:0048523 0.01268534 1.89669779 5153 288 117 18123 
process 

GO:BP negative regulation of GO:0043066 0.01270119 1.89615561 928 288 34 18123 
apoptotic process 

СО:ВР apoptotic signaling pathway GO:0097190 0.01274108 1.89479359 609 288 26 18123 

GO:BP | anatomical structure GO:0009653 | 0.01274496 | 1.89466146 2816 | 288 74 18123 
morphogen 

GO:BP | positive regulation of actin GO:0030838 | 0.01291088 | 1.88904409 102 288 10 18123 
filament polymerization 

СО:ВР | cell junction organization GO:0034330 | 0.013376 187367362 727 288 29 18123 

GO:BP positive regulation of protein- GO:0031334 0.01402912 1.85296945 265 288 16 18123 
containing complex assembly 

СО:ВР post-translational protein GO:0043687 0.01448382 1.83911678 362 288 19 18123 
modification 

СО:ВР anatomical structure GO:0048856 0.01601036 1.79559892 6164 288 134 18123 
development 

GO:BP regulation of apoptotic GO:0042981 0.01672779 1.77656138 1560 288 48 18123 
proce: 

GO:BP positive regulation of GO:0051251 0.01694871 1.77086323 366 288 19 18123 
lymphocyte activation 

GO:BP | angiogenesis GO:0001525 | 0.01802467 | 1.74413274 621 288 26 18123 

GO:BP epithelium development GO:0060429 0.01880272 1.72577925 1338 288 43 18123 

СО:ВР | tissue development GO:0009888 | 0.01915473 | 171772385 203 |288 59 18123 

GO:BP regulation of actin filament- GO:0032970 0.01959768 1.70779538 404 288 20 18123 
based process 

СО:ВР cytokine-mediated signaling GO:0019221 0.02029365 1.69263978 824 288 31 18123 
pathway 

GO:BP epithelial cell migration GO:0010631 0.02055889 1.68700029 371 288 19 18123 

GO:BP protein polymerization GO:0051258 0.02254642 1.6469224 307 288 17 18123 

СО:ВР epithelium migration GO:0090132 0.02304407 1.63744085 374 288 19 18123 

GO:BP protein-containing complex GO:0065003 0.02465889 1.60802656 1724 288 51 18123 
assembly 

GO:BP cell chemotaxis GO:0060326 0.02560235 1.59172022 310 288 17 18123 

GO:BP regulation of establishment of | GO:0070201 0.02561607 1.59148748 595 288 25 18123 
protein localization 

GO:BP ephrin receptor signaling GO:0048013 0.02718343 1.56569571 88 288 9 18123 
pathway 

СО:ВР regulation of cell adhesion GO:0030155 0.02813063 1.55082056 756 288 29 18123 

GO:BP response to cadmium ion GO:0046686 0.02985421 1.52499448 68 288 8 18123 

GO:BP establishment or maintenance GO:0007163 0.03001834 1.52261329 220 288 14 18123 
of cell polarity 

GO:BP myeloid leukocyte migration GO:0097529 0.03001834 1.52261329 220 288 14 18123 
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GO:BP regulation of actin GO:0008064 0.03010199 1.52140479 191 288 T3 18123 
polymerization or 
depolymerization 

GO:BP neutrophil-mediated killing of | GO:0070943 0.03041074 1.51697298 10 288 4 18123 
symbiont cell 

GO:BP actin polymerization or GO:0008154 0.0315984 1.50033491 221 288 14 18123 
depolymerization 

GO:BP regulation of actin filament GO:0030832 0.0318358 1.49708422 192 288 13 18123 
length 

GO:BP leukocyt GO:0001909 0.03228312 1.49102446 113 288 10 18123 
cytotoxicity 

GO:BP extracellular matrix GO:0030198 0.03338679 1.47642539 419 288 20 18123 
organization 

GO:BP extracellular structure GO:0043062 0.03456012 1.46142479 420 288 20 18123 
organization 

GO:BP response to toxic substance GO:0009636 0.03585527 1.44544698 254 288 15 18123 

GO:BP regulation of protein- GO:0043254 0.03713534 1.43021263 458 288 21 18123 
containing complex assembly 

СО:ВР acute inflammatory response GO:0002526 0.03769252 1.42374485 115 288 10 18123 

GO:BP regulation of intrinsic GO:2001242 0.03790212 1.42133646 167 288 12 18123 
apoptotic signaling pathway 

GO:BP system development GO:0048731 0.03847193 1.41485605 5095 288 114 18123 

GO:BP ameboidal-type cell migration | GO:0001667 0.03878637 1.41132081 496 288 22 18123 

GO:BP response to ethanol GO:0045471 0.03985333 1.39953535 141 288 11 18123 

GO:BP antigen receptor-mediated GO:0050851 0.0402315 1.39543375 321 288 17 18123 
signaling pathway 

GO:BP detoxification GO:0098754 0.0455075 1.34191703 143 288 11 18123 

GO:BP positive regulation of GO:0048260 0.04905683 1.30930056 53 288 7 18123 
receptor-mediated endocytosis 

GO:BP NADP metabolic process GO:0006739 0.04985034 1.3023319 36 288 6 18123 

GO:CC extracellular exosome GO:0070062 3.43E-166 165.464669 2178 291 235 18964 

GO:CC extracellular vesicle GO:1903561 3.48E-162 161.458778 2263 291 235 18964 

GO:CC extracellular membrane- GO:0065010 4.30E-162 161.366514 2265 291 235 18964 
bounded organelle 

GO:CC extracellular organelle GO:0043230 4.30E-162 161.366514 2265 291 235 18964 

GO:CC extracellular space GO:0005615 8.67E-157 156.061801 3595 291 264 18964 

GO:CC extracellular region GO:0005576 2.51E-145 144.600098 4564 291 274 18964 

GO:CC vesicle GO:0031982 2.30E-126 125.638439 4058 291 253 18964 

GO:CC secretory granule lumen GO:0034774 8.91E-79 78.0501465 320 291 84 18964 

GO:CC cytoplasmic vesicle lumen GO:0060205 2.85E-78 77.5451298 324 291 84 18964 

GO:CC vesicle lumen GO:0031983 5.06E-78 77.2954413 326 291 84 18964 

GO:CC secretory granule GO:0030141 2.80E-72 71.5534479 857 291 112 18964 

GO:CC secretory vesicle GO:0099503 1.22Е-63 62.9139476 1027 291 112 18964 

GO:CC cytoplasmic vesicle GO:0031410 1.80E-49 48.7437679 2433 291 143 18964 

GO:CC intracellular vesicle GO:0097708 2.33E-49 48.6317521 2438 291 143 18964 

GO:CC blood microparticle GO:0072562 5.23E-42 41.2811424 139 291 43 18964 

GO:CC ficolin-1-rich granule lumen GO:1904813 3.16E-38 37.4999676 124 291 39 18964 

GO:CC ficolin-1-rich granule GO:0101002 7.98Е-35 34.0978114 184 291 42 18964 

GO:CC azurophil granule lumen GO:0035578 2.90E-22 21.5371026 91 291 25 18964 

GO:CC endomembrane system GO:0012505 3.76Е-22 21.4242576 4651 291 152 18964 

GO:CC primary lysosome GO:0005766 5.82E-21 20.2350713 154 291 29 18964 

GO:CC azurophil granule GO:0042582 5.82E-21 20.2350713 154 291 29 18964 

GO:CC collagen-containing GO:0062023 1.04E-20 19.9839159 421 291 43 18964 
extracellular matrix 

GO:CC immunoglobulin complex GO:0019814 4.53E-20 19.3439023 165 291 29 18964 

GO:CC vacuolar lumen GO:0005775 2.16E-19 18.6655233 174 291 29 18964 
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GO:CC extracellular matrix GO:0031012 2.70E-18 17.5682762 562 291 46 18964 
GO:CC external encapsulating GO:0030312 2.90E-18 17.5369303 563 291 46 18964 
structure 
GO:CC focal adhesion GO:0005925 5.10E-18 17.2928492 420 291 40 18964 
GO:CC cell-substrate junction GO:0030055 9.29E-18 17.0318551 427 291 40 18964 
GO:CC tertiary granule lumen GO:1904724 3.23Е-17 16.4910719 55 291 18 18964 
GO:CC membrane-bounded organelle GO:0043227 1.46Е-16 15.8358633 13201 291 265 18964 
GO:CC anchoring junction GO:0070161 6.35E-15 14.1974039 838 291 51 18964 
GO:CC lysosome GO:0005764 1.06E-14 13.9753658 723 291 47 18964 
GO:CC lytic vacuole GO:0000323 1.06E-14 13.9753658 723 291 47 18964 
GO:CC melanosome GO:0042470 8.17E-14 13.0879763 109 291 20 18964 
GO:CC pigment granule GO:0048770 8.17E-14 13.0879763 109 291 20 18964 
GO:CC cell surface GO:0009986 1.00E-13 12.9984142 896 291 51 18964 
GO:CC tertiary granule GO:0070820 1.89E-13 12.722488 163 291 23 18964 
GO:CC specific granule lumen GO:0035580 1.94E-13 12.7124678 61 291 16 18964 
GO:CC vacuole GO:0005773 2.62Е-13 12.5816446 818 291 48 18964 
GO:CC cell periphery GO:0071944 3.47E-12 11.4599687 6178 291 158 18964 
GO:CC immunoglobulin complex, GO:0042571 6.74E-12 11.1711964 75 291 16 18964 
circulating 
GO:CC organelle GO:0043226 8.46Е-12 11.0726524 14027 | 291 266 18964 
GO:CC side of membrane GO:0098552 2.16E-11 10.6653458 622 291 39 18964 
GO:CC specific granule GO:0042581 1.28E-10 9.89430479 158 291 20 18964 
GO:CC actin cytoskeleton GO:0015629 2.59E-10 9.58694898 517 291 34 18964 
GO:CC platelet alpha granule lumen GO:0031093 3.16E-10 9.50022942 66 291 14 18964 
GO:CC endoplasmic reticulum lumen GO:0005788 8.18E-10 9.08740858 313 291 26 18964 
GO:CC platelet alpha granule GO:0031091 1.99E-09 8.70096483 90 291 15 18964 
GO:CC membrane-enclosed lumen GO:0031974 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC intracellular organelle lumen GO:0070013 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC organelle lumen GO:0043233 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC cytoskeleton GO:0005856 3.83E-09 8.41664118 2339 291 78 18964 
GO:CC endocytic vesicle lumen GO:0071682 4.95E-09 8.30565005 22 291 9 18964 
GO:CC external side of plasma GO:0009897 1.18Е-08 7.9282247 409 291 28 18964 
membrane 
GO:CC cornified envelope GO:0001533 1.28Е-08 7.89327475 44 291 11 18964 
GO:CC Arp2/3 protein complex GO:0005885 1.66E-08 7.78030014 11 291 7 18964 
GO:CC cytosol GO:0005829 1.82E-08 7.74080428 5361 291 134 18964 
GO:CC polymeric cytoskeletal fiber GO:0099513 5.10E-08 7.29270151 766 291 38 18964 
GO:CC supramolecular fiber GO:0099512 6.89E-08 7.1620648 997 291 44 18964 
GO:CC supramolecular polymer GO:0099081 8.86E-08 7.05250542 1005 291 44 18964 
GO:CC endocytic vesicle GO:0030139 1.24E-07 6.9065204 335 291 24 18964 
GO:CC intermediate filament GO:0045111 1.83E-07 6.73692411 260 291 21 18964 
cytoskeleton 
GO:CC IgA immunoglobulin GO:0071746 2.34E-07 6.63150015 5 291 5 18964 
complex, circulating 
GO:CC IgA immunoglobulin complex | GO:0071745 2.34E-07 6.63150015 5 291 5 18964 
GO:CC secretory IgA GO:0071751 2.34E-07 6.63150015 5 291 5 18964 
immunoglobulin complex 
GO:CC polymeric IgA GO:0071749 2.34E-07 6.63150015 5 291 5 18964 
immunoglobulin complex 
GO:CC intermediate filament GO:0005882 3.71E-07 6.43017726 219 291 19 18964 
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GO:GG | cytoplasm GO:0005737 | 1.3253Е-06 | 5.87769294 11951 | 291 229 18964 
GO:CC | phagocytic vesicle GO:0045335 | 8.7887Е-06 | 5.05607765 139 291 14 18964 
GO:CC | fibrinogen complex GO:0005577 | 1.2595E-05 | 4.89978918 8 291 5 18964 
GO:CC | cortical cytoskeleton GO:0030863 | 1.2708Е-05 | 4.89592901 121 291 13 18964 
GO:CC | cell leading edge GO:0031252 | L3645E-05 | 4.86503857 426 291 24 18964 
GO:CC | monomeric IgA GO:0071748 | 1.5433Е-05 | 4.81154372 4 291 4 18964 
immunoglobulin complex 
GO:CC | supramolecular complex GO:0099080 | 1.667Е-05 | 4.7780559 1325 | 291 47 18964 
GO:CC | desmosome GO:0030057 | 2.009E-05 | 4.69702735 25 291 7 18964 
GO:CC | plasma membrane GO:0005886 | 6.7254Е-05 | 4.17227899 5680 | 291 128 18964 
GO:CC | cell junction GO:0030054 | 8.8728E-05 | 4.05194049 2107 |291 62 18964 
GO:CC | lamellipodium GO:0030027 | 0.00016992 | 3.7697552 203 291 15 18964 
GO:CC | protein-containing complex GO:003299] | 0.00018814 | 372551029 5541 | 291 124 18964 
GO:CC | cell cortex GO:0005938 | 0.00021369 | 3.67020642 326 291 19 18964 
GO:CC | cortical actin cytoskeleton GO:0030864 | 0.00041917 | 3.37761358 92 291 10 18964 
GO:CC | actin filament GO:0005884 | 0.00058385 | 323370147 118 291 П 18964 
GO:CC | keratin filament GO:0045095 | 0.0007509 | 3.12441547 98 291 10 18964 
GO:CC | IgM immunoglobulin GO:0071754 | 0.00101607 | 2.99307497 3 291 3 18964 
complex, circulating 
GO:CC | IgM immunoglobulin GO:0071753 | 0.00101607 | 2.99307497 3 291 3 18964 
complex 
GO:CC | secretory dimeric IgA GO:0071752 | 0.00101607 | 2.99307497 3 291 3 18964 
immunoglobulin complex 
GO:CC | dimeric IgA immunoglobulin | GO:0071750 | 0.00101607 | 2.99307497 3 291 3 18964 
complex 
GO:CC | pentameric IgM GO:0071756 | 0.00101607 | 2.99307497 3 291 3 18964 
immunoglobulin complex 
GO:CC | podosome GO:0002102 | 0.0015341 | 281414641 30 291 6 18964 
GO:CC | cell-cell junction GO:0005911 | 0.0025887 | 2.58691768 496 291 22 18964 
GO:CC | lysosomal membrane GO:0005765 | 0.00349065 | 2.45709422 395 291 19 18964 
GO:CC | lytic vacuole membrane GO:0098852 | 0.00349065 | 2.45709422 395 291 19 18964 
GO:CC СОР signalosome GO:0008180 | 0.00393191 | 2.40539635 35 291 6 18964 
GO:CC | membrane microdomain GO:0098857 | 0.00593442 | 2.22662149 339 291 17 18964 
СО:СС | membrane raft GO:0045121 | 0.00593442 | 2.22662149 339 291 17 18964 
GO:CC | vacuolar membrane GO:0005774 | 0.00646691 | 2.18930291 450 291 20 18964 
GO:CC | phagocytic vesicle lumen GO:0097013 | 0.00993064 | 2.00302292 5 291 3 18964 
GO:CC | high-density lipoprotein GO:0034364 | 0.02050173 | 1.6882094 29 291 5 18964 
particle 
GO:CC | ruffle GO:0001726 | 0.03111583 | 1.50701855 180 291 П 18964 
GO:CC | proton-transporting two-sector | GO:0033178 | 0.03136481 | 1.50355731 17 291 4 18964 
ATPase complex, catalytic 
domain 
GO:CC | phagocytic vesicle membrane | GO:0030670 | 0.04309801 | 136554275 75 291 7 18964 
GO:CC | extracellular exosome GO:0070062 | 3.43E-166 | 165.464669 2178 | 291 235 18964 
GO:CC | extracellular vesicle GO:1903561 | 3.48E-162 | 161.458778 2263 | 291 235 18964 
GO:CC | extracellular membrane- GO:0065010 | 4.30Е-162 | 161.366514 2265 | 291 235 18964 
bounded organelle 
GO:CC | extracellular organelle GO:0043230 | 4.30Е-162 | 161.366514 2265 | 291 235 18964 
GO:CC | extracellular space GO:0005615 | 8.67E-157 | 156.061801 3595 | 291 264 18964 
СО:СС | extracellular region GO:0005576 | 2.51Е-145 | 144.600098 4564 | 291 274 18964 
GO:CC | vesicle GO:0031982 | 2.30Е-126 | 125.638439 4058 | 291 253 18964 
GO:CC | secretory granule lumen GO:0034774 | 8.91E-79 78.0501465 320 291 84 18964 
GO:CC | cytoplasmic vesicle lumen GO:0060205 | 2.85Е-78 77.5451298 324 291 84 18964 
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GO:CC vesicle lumen GO:0031983 5.06E-78 77.2954413 326 291 84 18964 
GO:CC secretory granule GO:0030141 2.80E-72 71.5534479 857 291 112 18964 
GO:CC secretory vesicle GO:0099503 1.22Е-63 62.9139476 1027 291 112 18964 
GO:CC cytoplasmic vesicle GO:0031410 1.80E-49 48.7437679 2433 291 143 18964 
GO:CC intracellular vesicle GO:0097708 2.33E-49 48.6317521 2438 291 143 18964 
GO:CC blood microparticle GO:0072562 5.23E-42 41.2811424 139 291 43 18964 
GO:CC ficolin-1-rich granule lumen GO:1904813 3.16E-38 37.4999676 124 291 39 18964 
GO:CC ficolin-1-rich granule GO:0101002 7.98Е-35 34.0978114 184 291 42 18964 
GO:CC azurophil granule lumen GO:0035578 2.90E-22 21.5371026 91 291 25 18964 
GO:CC endomembrane system GO:0012505 3.76E-22 21.4242576 4651 291 152 18964 
GO:CC primary lysosome GO:0005766 5.82E-21 20.2350713 154 291 29 18964 
GO:CC azurophil granule GO:0042582 5.82E-21 20.2350713 154 291 29 18964 
GO:CC collagen-containing GO:0062023 1.04E-20 19.9839159 421 291 43 18964 
extracellular matrix 
GO:CC immunoglobulin complex, GO:0019814 4.53E-20 19.3439023 165 291 29 18964 
GO:CC vacuolar lumen GO:0005775 2.16E-19 18.6655233 174 291 29 18964 
GO:CC extracellular matrix GO:0031012 2.70E-18 17.5682762 562 291 46 18964 
GO:CC external encapsulating GO:0030312 2.90E-18 17.5369303 563 291 46 18964 
structure 
GO:CC focal adhesion GO:0005925 5.10E-18 17.2928492 420 291 40 18964 
GO:CC cell-substrate junction GO:0030055 9.29E-18 17.0318551 427 291 40 18964 
GO:CC tertiary granule lumen GO:1904724 3.23E-17 16.4910719 55 291 18 18964 
GO:CC membrane-bounded organelle GO:0043227 1.46E-16 15.8358633 13201 291 265 18964 
GO:CC anchoring junction GO:0070161 6.35E-15 14.1974039 838 291 51 18964 
GO:CC lysosome GO:0005764 1.06E-14 13.9753658 723 291 47 18964 
GO:CC lytic vacuole GO:0000323 1.06E-14 13.9753658 723 291 47 18964 
GO:CC melanosome GO:0042470 8.17E-14 13.0879763 109 291 20 18964 
GO:CC pigment granule GO:0048770 8.17E-14 13.0879763 109 291 20 18964 
GO:CC cell surface GO:0009986 1.00E-13 12.9984142 896 291 51 18964 
GO:CC tertiary granule GO:0070820 1.89Е-13 12.722488 163 291 23 18964 
GO:CC specific granule lumen GO:0035580 1.94E-13 12.7124678 61 291 16 18964 
GO:CC vacuole GO:0005773 2.62Е-13 12.5816446 818 291 48 18964 
GO:CC cell periphery GO:0071944 3.47E-12 11.4599687 6178 291 158 18964 
GO:CC immunoglobulin complex, GO:0042571 6.74E-12 11.1711964 75 291 16 18964 
circulating 
GO:CC organelle GO:0043226 8.46E-12 11.0726524 14027 | 291 266 18964 
GO:CC side of membrane GO:0098552 2.16E-11 10.6653458 622 291 39 18964 
GO:CC specific granule GO:0042581 1.28E-10 9.89430479 158 291 20 18964 
GO:CC actin cytoskeleton GO:0015629 2.59E-10 9.58694898 517 291 34 18964 
GO:CC platelet alpha granule lumen GO:0031093 3.16E-10 9.50022942 66 291 14 18964 
GO:CC endoplasmic reticulum lumen GO:0005788 8.18E-10 9.08740858 313 291 26 18964 
GO:CC platelet alpha granule GO:0031091 1.99E-09 8.70096483 90 291 15 18964 
GO:CC membrane-enclosed lumen GO:0031974 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC intracellular organelle lumen GO:0070013 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC organelle lumen GO:0043233 3.02E-09 8.52057389 5579 291 140 18964 
GO:CC cytoskeleton GO:0005856 3.83E-09 8.41664118 2339 291 78 18964 
GO:CC endocytic vesicle lumen GO:0071682 4.95E-09 8.30565005 22 291 9 18964 
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GO:CC external side of plasma GO:0009897 1.18E-08 7.9282247 409 291 28 18964 
membrane 
GO:CC cornified envelope GO:0001533 1.28E-08 7.89327475 44 291 п 18964 
GO:CC Arp2/3 protein complex GO:0005885 1.66E-08 7.78030014 11 291 7 18964 
GO:CC cytosol GO:0005829 1.82E-08 7.74080428 5361 291 134 18964 
GO:CC polymeric cytoskeletal fiber GO:0099513 5.10E-08 7.29270151 766 291 38 18964 
GO:CC supramolecular fiber GO:0099512 6.89E-08 7.1620648 997 291 44 18964 
GO:CC supramolecular polymer GO:0099081 8.86E-08 7.05250542 1005 291 44 18964 
GO:CC endocytic vesicle GO:0030139 1.24E-07 6.9065204 335 291 24 18964 
GO:CC intermediate filament GO:0045111 1.83E-07 6.73692411 260 291 21 18964 
cytoskeleton 
GO:CC IgA immunoglobulin GO:0071746 2.34E-07 6.63150015 5 291 5 18964 
complex, circulating 
GO:CC IgA immunoglobulin complex | GO:0071745 2.34E-07 6.63150015 8 291 5 18964 
GO:CC secretory IgA GO:0071751 2.34E-07 6.63150015 5 291 5 18964 
immunoglobulin complex 
GO:CC polymeric IgA GO:0071749 2.34E-07 6.63150015 5 291 5 18964 
immunoglobulin complex 
GO:CC intermediate filament GO:0005882 3.71E-07 6.43017726 219 291 19 18964 
GO:CC cytoplasm GO:0005737 1.3253E-06 5.87769294 11951 291 229 18964 
GO:CC phagocytic vesicle GO:0045335 8.7887E-06 5.05607765 139 291 14 18964 
GO:CC fibrinogen complex GO:0005577 1.2595E-05 4.89978918 8 291 5 18964 
GO:CC cortical cytoskeleton GO:0030863 1.2708E-05 4.89592901 121 291 13 18964 
GO:CC cell leading edge GO:0031252 1.3645E-05 4.86503857 426 291 24 18964 
GO:CC monomeric IgA. GO:0071748 1.5433E-05 4.81154372 4 291 4 18964 
immunoglobulin complex 
GO:CC supramolecular complex GO:0099080 1.667E-05 4.7780559 1325 291 47 18964 
GO:CC desmosome GO:0030057 2.009E-05 4.69702735 25 291 ri 18964 
GO:CC plasma membrane GO:0005886 6.7254E-05 4.17227899 5680 291 128 18964 
GO:CC cell junction GO:0030054 8.8728E-05 4.05194049 2107 291 62 18964 
GO:CC lamellipodium GO:0030027 0.00016992 3.7697552 203 291 15 18964 
GO:CC protein-containing complex GO:0032991 0.00018814 3.72551029 5541 291 124 18964 
GO:CC cell cortex GO:0005938 0.00021369 3.67020642 326 291 19 18964 
GO:CC cortical actin cytoskeleton GO:0030864 0.00041917 3.37761358 92 291 10 18964 
GO:CC actin filament GO:0005884 0.00058385 3.23370147 118 291 п 18964 
GO:CC keratin filament GO:0045095 0.0007509 3.12441547 98 291 10 18964 
GO:CC IgM immunoglobulin GO:0071754 0.00101607 2.99307497 3 291 3 18964 
complex, circulating 
GO:CC IgM immunoglobulin GO:0071753 0.00101607 2.99307497 3 291 3 18964 
complex 
GO:CC secretory dimeric IgA GO:0071752 0.00101607 2.99307497 3 291 3 18964 
immunoglobulin complex 
GO:CC dimeric IgA immunoglobulin GO:0071750 0.00101607 2.99307497 4 291 3 18964 
complex 
GO:CC pentameric IgM GO:0071756 0.00101607 2.99307497 3 291 3 18964 
immunoglobulin complex 
GO:CC podosome GO:0002102 0.0015341 2.81414641 30 291 6 18964 
GO:CC cell-cell junction GO:0005911 0.0025887 2.58691768 496 291 22 18964 
GO:CC lysosomal membrane GO:0005765 0.00349065 2.45709422 395 291 19 18964 
GO:CC lytic vacuole membrane GO:0098852 0.00349065 2.45709422 395 291 19 18964 
GO:CC СОР9 signalosome GO:0008180 0.00393191 2.40539635 35 291 6 18964 
GO:CC membrane microdomain GO:0098857 0.00593442 2.22662149 220 291 17 18964 
GO:CC membrane raft GO:0045121 0.00593442 2.22662149 339 291 17 18964 
GO:CC vacuolar membrane GO:0005774 0.00646691 2.18930291 450 291 20 18964 
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GO:CC phagocytic vesicle lumen GO:0097013 0.00993064 2.00302292 5 291 3 18964 

GO:CC high-density lipoprotein GO:0034364 0.02050173 1.6882094 29 291 5 18964 
particle 

GO:CC ruffle GO:0001726 0.03111583 1.50701855 180 291 11 18964 

GO:CC proton-transporting two-sector | GO:0033178 0.03136481 1.50355731 17 291 4 18964 
AT omplex, catalytic 
domain 

GO:CC phagocytic vesicle membrane GO:0030670 0.04309801 1.36554275 75 291 7 18964 

KEGG Salivary secretion KEGG:04970 | 4.88E-12 11.3116881 91 175 19 8000 

KEGG Salmonella infection KEGG:05132 1.9526E-06 5.70939441 249 175 22 8000 

KEGG Staphylococcus aureus KEGG:05150 | 4.0995E-06 5.38727309 87 175 13 8000 
infection 

KEGG Carbon metabolism KEGG:01200 1.9318E-05 4.71403767 116 175 14 8000 

KEGG Estrogen signaling pathway KEGG:04915 | 2.5575E-05 4.59219099 137 175 15 8000 

КЕСС Complement and coagulation KEGG:04610 | 2.6347E-05 4.57926592 85 175 12 8000 
cascades 

KEGG Regulation of actin KEGG:04810 | 9.8597E-05 4.00613481 216 175 18 8000 
cytoskeleton 

KEGG Pathogenic Escherichia coli KEGG:05130 | 0.00011486 3.93984487 196 175 17 8000 
infection 

KEGG Glycolysis / Gluconeogenesis KEGG:00010 | 0.0001596 3.79696132 67 175 10 8000 

КЕСС Phagosome KEGG:04145 | 0.00035236 3.45301673 147 175 14 8000 

КЕСС Tight junction KEGG:04530 | 0.0003785 3.42193447 169 175 15 8000 

KEGG Biosynthesis of amino acids KEGG:01230 | 0.00045868 3.33848606 75 175 10 8000 

КЕСС Pentose phosphate pathway KEGG:00030 | 0.0037617 2.42461576 30 175 6 8000 

KEGG Bacterial invasion of epithelial | KEGG:05100 | 0.00408906 2.38837694 77 175 9 8000 
cells 

KEGG Fc gamma R-mediated KEGG:04666 | 0.0042006 2.37668861 96 175 10 8000 
phagocytosis 

KEGG Biosynthesis of amino acids KEGG:01230 | 0.00045868 3.33848606 75 175 10 8000 

КЕСС Pentose phosphate pathway KEGG:00030 | 0.0037617 2.42461576 30 175 6 8000 

КЕСС Bacterial invasion of epithelial | KEGG:05100 | 0.00408906 2.38837694 77 175 9 8000 
cells 

KEGG Fc gamma R-mediated KEGG:04666 | 0.0042006 2.37668861 96 175 10 8000 
phagocytosis 

KEGG Neutrophil extracellular trap KEGG:04613 | 0.00618118 2.2089288 189 175 14 8000 
formation 

KEGG Shigellosis KEGG:05131 | 0.02990563 1.52424697 246 175 15 8000 


Table 8 14.GO analysis of breath exosomal proteome with proteins associated with “regulated exocytosis” removed (184 
proteins). The analysis was performed by using g:Profiler software. 
*The list of proteins associated with each ontology is available on request deanna.shea@vuw.ac.nz 


Source Term name Term ID Adjusted p- | Negative log10 Term Query |Intersec Effective 
value of adjusted size size tion domain 
p-value size size 
GO:MF antigen binding GO:0003823 5.7463610345 | 22.24060709146 | 163 182 27 18694 
7402Е-23 6700 
СО:МЕ immunoglobulin receptor GO:0034987 2.7552817540 | 13.55983398375 | 80 182 16 18694 
binding 6458E-14 2700 
GO:MF structural constituent of GO:0005200 1.0301079689 | 12.98711725310 | 104 182 17 18694 
cytoskeleton 2422E-13 980 
GO:MF structural molecule activity GO:0005198 9.9793182410 | 7.000899127476 | 698 182 28 18694 
6046E-08 060 
GO:MF endopeptidase inhibitor activity | GO:0004866 1.2760220960 | 6.894141805163 |179 182 15 18694 
1989E-07 110 
GO:MF peptidase inhibitor activity GO:0030414 1.8807210072 | 6.725675624445 | 184 182 15 18694 
174E-07 660 
GO:MF endopeptidase regulator activity | GO:0061135 2.5429137994 | 6.594668361441 | 188 182 15 18694 
1407E-07 800 
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GO:MF  |enzyme inhibitor activity GO:0004857 | 6.6780889566 | 6.175347800147 | 386 182 20 18694 
5373Е-07 400 
СО:МЕ | peptidase regulator activity GO:0061134 | 0.0000030336 | 5.518030708462 | 225 182 15 18694 
76668828510 |410 
GO:MF | cadherin binding GO:0045296 | 0.0000147587 | 4.830950201831 | 332 182 17 18694 
57541583500 | 8300 
GO:MF | cell adhesion molecule binding | GO:0050839 | 0.0000490991 | 4.308925935662 | 548 182 21 18694 
6024458210 |430 
GO:MF | cysteine-type endopeptidase |GO:0004869 | 0.0005318295 | 3.274227551690 | 55 182 7 18694 
inhibitor activity 303777910 |3100 
GO:MF | actin filament binding GO:0051015 | 0.0005553348 | 3.255445095270 | 210 182 12 18694 
196708230 |790 
GO:MF | actin binding GO:0003779 | 0.0040473431 | 2.392829975357 | 445 182 16 18694 
263316500 |500 
СО:МЕ | protein-containing complex GO:0044877 | 0.0093700411 | 2.028258501405 | 1282 |182 29 18694 
binding 50286080 | 5400 
GO:MF | enzyme regulator activity GO:0030234 | 0.0115557489 | 1.937201900711 | 1228 |182 28 18694 
795888 8400 
СО:МЕ | signaling receptor binding GO:0005102 — | 0.0167239437 | 1.776661303070 | 1681 |182 34 18694 
02746500 1800 
GO:MF | proton-transporting ATP GO:0046933 | 0.0291756370 | 1.534979652669 | 22 182 4 18694 
synthase activity, rotational 29161600 |7600 
mechanism 
GO:MF | protease binding GO:0002020 | 0.0346488681 | 1.460310948074 | 139 182 8 18694 
12971900 |3800 
GO:BP | humoral immune response GO:0006959 | 1.5756103388 | 29.80255117810 | 384 183 44 18092 
3263E-30 |2000 
GO:BP | humoral immune response GO:0002455 | 6.5459348347 | 23.18402832243 | 155 183 28 18092 
mediated Бу circulating 4649-24 |9400 
immunoglobulin 
GO:BP | complement activation, GO:0006958 | 1.3535656363 | 22.86852067981 | 142 183 27 18092 
classical pathway 6743E-23 1400 
GO:BP | complement activation GO:0006956 | 2.3360536997 | 22.63151717814 | 180 183 29 18092 
9412E-23 5200 
GO:BP | immunoglobulin mediated GO:0016064 | 2.2102243287 | 19.65556364496 | 227 183 29 18092 
immune response 455Е-20 100 
GO:BP |В cell mediated immunity GO:0019724 | 3.2357736141 | 19.49002187072 | 230 183 29 18092 
811E-20 8200 
СО:ВР  |phagocytosis GO:0006909 | 7.0101502517 | 18.15427267350 | 390 183 34 18092 
4999-19 170 
GO:BP activation of immune response | GO:0002253 | 2.1343487065 | 17.67073462475 | 566 183 39 18092 
6997Е-18 8200 
GO:BP | immune response-activating cell | GO:0002429 | 4.7459765265 | 17.32367441412 | 477 183 36 18092 
surface receptor signaling 2044E-18 2900 
pathway 
GO:BP | immune response-activating — |GO:0002757 | 4.7459765265 | 17.32367441412 | 477 183 36 18092 
signal transduction 2044E-18 2900 
GO:BP | immune response-regulating | GO:0002433 | 3.2352984419 | 16.49008565141 | 135 183 22 18092 
cell surface receptor signaling 0179E-17 17 
pathway involved in 
phagocytosis 
GO:BP | Fc-gamma receptor signaling | GO:0038096 | 3.2352984419 | 16.49008565141 | 135 183 22 18092 
pathway involved іп 0179E-17 17 
phagocytosis 
GO:BP | immune response-regulating | GO:0002768 | 4.7084313763 | 16.32712375487 | 511 183 36 18092 
cell surface receptor signaling 5417E-17 9600 
pathway 
GO:BP | immune response-regulating | GO:0002764 | 5.7157380178 | 16.24292768541 | 514 183 36 18092 
signaling pathway 557E-17 070 
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GO:BP |Fc-gamma receptor signaling | СО:0038094 | 6.2790074625 | 16.20210900078 | 139 183 22 18092 
pathway 4861E-17 590 
GO:BP | Fc receptor mediated GO:0002431 | 8.6764746065 | 16.06165669993 | 141 183 22 18092 
stimulatory signaling pathway 3706Е-17 |090 
СО:ВР | lymphocyte mediated immunity | GO:0002449 |1.2626219785 | 15.89872665515 | 365 183 31 18092 
1614Е-16 |980 
GO:BP | positive regulation of immune | GO:0050778 — |1.3540484041 | 15.86836581033 | 753 183 42 18092 
response 4269E-16 | 6000 
GO:BP | adaptive immune response GO:0002460 | 18806887962 | 15.72568306263 | 370 183 31 18092 
based on somatic recombination 7614E-16 |2300 
of immune receptors built from 
immunoglobulin superfamily 
domains 
GO:BP | regulation of complement GO:0030449 | 4.7529057496 | 14.32304079770 | 110 183 19 18092 
activation 2451E-15 |2700 
GO:BP | regulation of humoral immune | GO:0002920 | 7.4393349799 | 14.12846588534 | 131 183 20 18092 
response 9882E-15 |4600 
GO:BP | endocytosis GO:0006897 | 2.7704813253 | 13.55744477288 | 701 183 38 18092 
8607E-14 5100 
GO:BP | defense response to bacterium | GO:0042742 | 7.0537082041 | 13.15158251006 | 356 183 28 18092 
7298E-14 |950 
GO:BP | immune response GO:0006955 | 1.0853082403 | 12.96444689952 | 2326 |183 68 18092 
6957Е-13 9800 
GO:BP  |regulation of immune response | GO:0050776 | 1.1493495305 | 12.93954787724 | 998 183 4 18092 
0126Е-13 4100 
GO:BP | Fc receptor signaling pathway | GO:0038093 | 8.0030288127 | 12.09674561954 | 240 183 23 18092 
2537E-13 4200 
GO:BP | antimicrobial humoral response | GO:0019730 | 8.8777430682 | 12.05169742809 | 144 183 19 18092 
4499E-13 3900 
GO:BP | positive regulation of immune | СО:0002684 | 1.4843042141 | 11.82847707951 | 1070 — [183 4 18092 
system process 8775Е-12 |3200 
GO:BP | сотійсабоп GO:0070268 | 3.0321187673 | 11.51825379149 | 112 183 17 18092 
3889E-12 |9800 
СО:ВР | adaptive immune response GO:0002250 | 6.8918735724 | 11.16166269804 | 698 183 35 18092 
3622E-12 |0200 
GO:BP | receptor-mediated endocytosis | GO:0006898 | 11872272198 | 10.92546615476 | 333 183 25 18092 
6454Е-11 9400 
СО:ВР | immune system process GO:0002376 | 1.3941071515 | 10.85570384492 |3322 |183 79 18092 
7804Е-11 4800 
GO:BP | phagocytosis, recognition GO:0006910 | 1.8110554759 | 9.742068246250 | 100 183 15 18092 
326Е-10 050 
GO:BP | regulation оГішпише system | СО:0002682 | 2.3761889831 | 9.624119021998 | 1625 |183 51 18092 
process 1763E-10 |380 
GO:BP | immune effector process GO:0002252 | 4.3920093903 | 9.357336740011 | 1310 |183 45 18092 
6124Е-10 130 
GO:BP | keratinization GO:0031424 | 3.0913581053 | 8.509850683011 | 224 183 19 18092 
2801E-09 890 
СО:ВР | defense response to other GO:0098542 | 3.3399841279 | 8.476255597010 | 1222 |183 42 18092 
organism 376E-09 44 
GO:BP involved in | GO:0044419 | 5.4778639180 | 8.261388760758 |2272 |183 59 18092 
tion between 662E-09 040 
СО:ВР | response to biotic stimulus GO:0009607 | 5.8345955027 | 8.233989247075 |1645 |183 49 18092 
5456E-09 |490 
GO:BP | phagocytosis, engulfment GO:0006911 | 6.5834131107 | 8.181548891968 | 127 183 15 18092 
792Е-09 13 
СО:ВР | response to other organism GO:0051707 | 1.0137046165 | 7.994088575648 | 1610 |183 48 18092 


8696Е-08 


410 
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СО:ВР | response to external biotic GO:0043207 | 10598505049 | 7.974755388945 | 1612 |183 48 18092 
stimulus 2916E-08 |650 
СО:ВР | keratinocyte differentiation GO:0030216 | 1.1635581833 | 7.934211895081 | 306 183 21 18092 
432E-08 930 
GO:BP | plasma membrane invagination | GO:0099024 | 18025299116 | 7.744117519749 | 136 183 15 18092 
6474E-08 |640 
GO:BP |В cell receptor signaling GO:0050853 | 1.8025299116 | 7.744117519749 | 136 183 15 18092 
pathway 6474E-08 |640 
СО:ВР | membrane invagination GO:0010324 | 4.1476192770 | 7.382201115704 | 144 183 15 18092 
3297E-08 |480 
СО:ВР | retina homeostasis GO:0001895 | 5.1512173105 | 7.288090128477 |79 183 12 18092 
3986E-08 |490 
GO:BP | positive regulation of B cell |GO:0050871 | 9.0655481281 | 7.042605932126 | 152 183 15 18092 
activation 6144E-08 750 
GO:BP | regulation of immune effector | GO:0002697 | 9.6975856104 | 7.013336377753 | 458 183 24 18092 
process 1552E-08 |490 
GO:BP | leukocyte mediated immunity | GO:0002443 | 1.8555984657 | 6.731515995239 | 891 183 33 18092 
4222E-07 |270 
GO:BP | immunoglobulin production | GO:0002377 | 3.0024669123 | 6.522521769897 | 194 183 16 18092 
0236Е-07 |970 
GO:BP | epidermal cell differentiation | СО:0009913 | 3.6898015705 | 6.432996988616 | 368 183 21 18092 
7271E-07 | 6800 
СО:ВР | response to bacterium GO:0009617 | 4.6797532250 | 6.329777047744 | 771 183 30 18092 
3965E-07 |090 
GO:BP | defense response GO:0006952 | 9.0030016933 | 6.045612668171 | 1831 |183 48 18092 
28E-07 280 
СО:ВР | negative regulation of GO:0010951 | 0.0000025671 | 5.590550233215 | 258 183 17 18092 
endopeptidase activity 41259319690 | 910 
0 
GO:BP | Fc-epsilon receptor signaling | GO:0038095 | 00000036168 | 5.441674546147 | 167 183 14 18092 
pathway 07993430330 | 7100 
0 
GO:BP | regulation of B cell activation | GO:0050864 | 0.0000037877 | 5.421624284442 | 198 183 15 18092 
01235057710 | 510 
GO:BP | negative regulation of peptidase | GO:0010466 | 0.0000054250 | 5.265595993848 |271 183 17 18092 
activity 53259505020 | 910 
0 
GO:BP | skin development GO:0043588 | 0.0000061633 | 5.210180211213 | 430 183 21 18092 
91978219760 |910 
0 
СО:ВР | cell recognition GO:0008037 | 0.0000064397 | 5.191131354893 | 239 183 16 18092 
44622325650 | 900 
GO:BP | response to stress GO:0006950 | 0.0000092050 | 5.035975367343 | 4199 |183 78 18092 
17800183040 |15 
GO:BP | innate immune response GO:0045087 | 0.0000134717 | 4.870576040728 | 999 183 32 18092 
48275181000 | 7200 
СО:ВР | pentose biosynthetic proc GO:0019322 | 0.0000230260 | 4.637781397124 | 4 183 4 18092 
05437108200 | 400 
СО:ВР | membrane organization GO:0061024 | 0.0000242954 | 4.614475571509 | 968 183 31 18092 
2096709870 | 220 
СО:ВР | production of molecular GO:0002440 | 0.0000288816 | 4.539377278471 | 303 183 17 18092 
mediator of immune response 97935607100 | 210 
GO:BP | leukocyte migration GO:0050900 | 0.0000383326 | 4.416430878639 | 523 183 2 18092 
7449917820 | 880 
СО:ВР | epidermis development GO:0008544 | 0.0000506094 | 4.295768745223 | 485 183 21 18092 
07716544800 | 150 
GO:BP positive regulation of GO:0051251 | 0.0000808140 | 4.092513058696 | 366 183 18 18092 
lymphocyte activation 6289598590 |940 
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СО:ВР negative regulation of GO:0045861 0.0000808140 | 4.092513058696 | 366 183 18 18092 
6289598590 |940 
GO:BP ceptor-mediated GO:0050851 0.0000809166 | 4.091962291259 | 325 183 17 18092 
signaling pathway 1540707030 |290 
СО:ВР vesicle-mediated transport GO:0016192 0.0001111979 | 3.953903189361 | 2193 183 49 18092 
5762856700 | 780 
GO:BP epithelial cell differentiation GO:0030855 0.0001291644 | 3.888857159801 | 817 183 27 18092 
0276893800 | 780 
СО:ВР В cell activation GO:0042113 0.0001370020 | 3.863272804319 | 337 183 17 18092 
9101154300 | 1800 
СО:ВР positive regulation of response | GO:0048584 0.0001920876 | 3.716500554445 | 2374 183 51 18092 
to stimulus 509448580 410 
СО:ВР regulation of endopeptidase GO:0052548 0.0002319863 | 3.634537530185 | 437 183 19 18092 
activity 7025293100 |930 
GO:BP tissue homeostasis GO:0001894 0.0002693516 | 3.S69680351675 | 272 183 15 18092 
549762930 990 
СО:ВР detection of chemical stimulus | GO:0001580 0.0003086800 | 3.510491374269 | 40 183 7 18092 
involved in sensory perception 9535022000 | 990 
of bitter taste 
GO:BP cell surface receptor signaling | GO:0007166 0.0003341092 | 3.476111466144 | 3242 183 62 18092 
pathway 7643926400 | 400 
GO:BP antibacterial humoral response | GO:0019731 0.0004532613 | 3.343651328168 | 62 183 8 18092 
331682490 140 
GO:BP positive regulation of leukocyte | GO:0002696 0.0005312732 | 3.274682041924 | 415 183 18 18092 
activation 605973440 360 
GO:BP sensory perception of bitter GO:0050913 0.0006132007 | 3.212397350128 | 44 183 H 18092 
taste 111825400 050 
GO:BP response to external stimulus GO:0009605 0.0006465111 | 3.189423959017 | 2991 183 58 18092 
82270932 8300 
GO:BP regulation of peptidase activity | GO:0052547 0.0006899666 | 3.161171929133 | 469 183 19 18092 
047802230 2600 
GO:BP establishment of localization GO:0051234 0.0007120255 | 3.147504444866 | 5398 183 87 18092 
126063500 4400 
GO:BP positive regulation of cell GO:0050867 0.0008074031 | 3.092909534011 | 427 183 18 18092 
activation 993702880 5200 
GO:BP sensory perception of taste GO:0050909 0.0008375359 | 3.076996534315 | 67 183 8 18092 
656655340 3500 
GO:BP | detection of chemical stimulus | GO:0050912 | 0.0008420199 | 3.074677601435 | 46 183 7 18092 
involved in sensory perception 833329030 |95 
of taste 
GO:BP regulation of lymphocyte GO:0051249 0.0008529041 | 3.069099778417 | 524 183 20 18092 
activation 382336410 |1000 
GO:BP response to stimulus GO:0050896 0.0008541684 | 3.068456485711 | 9469 183 129 18092 
26837562 4 
GO:BP cell motility GO:0048870 0.0009571955 | 3.018999343310 | 1856 183 42 18092 
186523500 9100 
GO:BP localization of cell GO:0051674 0.0009571955 | 3.018999343310 | 1856 183 42 18092 
186523500 9100 
GO:BP transport GO:0006810 0.0009894328 | 3.004613675799 | 5259 183 85 18092 
477033220 720 
GO:BP killing of cells of other GO:0031640 0.0010553713 | 2.976594690466 | 69 183 8 18092 
organism 724943600 22 
GO:BP intermediate filament GO:0045104 0.0017453551 | 2.758116194318 | 51 183 7 18092 
cytoskeleton organization 253858200 5700 
GO:BP intermediate filament-based GO:0045103 0.0019997503 | 2.699024211864 | 52 183 Ж 18092 
process 80685540 320 
GO:BP cytoskeleton organization GO:0007010 0.0020012024 | 2.698708978904 | 1435 183 35 18092 
27849250 3100 
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СО:ВР | movement of cell or subcellular | GO:0006928 | 0.0020068757 | 2.697479515777 |2337 |183 48 18092 
component 45422200 |5900 
GO:BP_ | locomotion GO:0040011 | 0.0021755007 | 2.662440772377 |2055 |183 44 18092 
012690000 |150 
СО:ВР | peptide cross-linking GO:0018149 | 0.0028523063 | 2.544803835897 | 35 183 6 18092 
192504900 |1300 
GO:BP | negative regulation of hydrolase | GO:0051346 | 0.0037425186 | 2.426836030108 | 475 183 18 18092 
activity 21231980 |9800 
GO:BP | anatomi GO:0060249 | 0.0047382032 | 2.324386310912 | 483 183 18 18092 
homeos 80567640 |7500 
GO:BP regulation of cell activation GO:0050865 | 0.0091625445 | 2.037983901714 | 666 183 21 18092 
29118860 |6800 
GO:BP |Iymphocyte activation GO:0046649 | 0.0100411904 | 1.998214794910 | 783 183 23 18092 
66760300 | 5600 
GO:BP | intermediate filament GO:0045109 | 0.0102712683 | 1.988375924585 | 25 183 5 18092 
organization 39334600 |4600 
GO:BP | regulation of response to GO:0048583 | 0.0106988069 | 1.970664649130 | 4324 |183 71 18092 
stimulus 43073700 |5100 
GO:BP | regulation of leukocyte GO:0002694 | 00115419087 | 1.937722362901 | 621 183 20 18092 
activation 6680960 0100 
GO:BP | proteolysis GO:0006508 | 0.0146145876 | 1.835213433912 | 1846 |183 39 18092 
4402970 3600 
GO:BP positive regulation of biological | GO:0048518 | 0.0226356866 | 1.645206327315 | 6466 |183 94 18092 
pro 1864640 550 
GO:BP | cell migration GO:0016477 | 0.0277745409 | 1.556353110791 | 1684 |183 36 18092 
13944800 |7100 
СО:ВР | pentose metabolic process GO:0019321 | 0.0288028219 | 1.540564960922 | 15 183 4 18092 
05524200 |69 
GO:BP epithelium development GO:0060429 | 0.0369356145 | 1.432554670904 | 1361 — |183 31 18092 
2509410 880 
GO:BP | pentose-phosphate shunt GO:0006098 | 0.0381006946 | 1.419067105821 | 16 183 4 18092 
8467650 2000 
GO:BP tissue development GO:0009888 | 0.0464635608 | 1.332887509890 | 2161 |183 42 18092 
64673600 | 5400 
GO:CC | extracellular space GO:0005615 | 2.0497506352 | 93.68829897031 3508 |188 167 18963 
7921E-94 |470 
GO:CC | extracellular exosome GO:0070062 | 2.9385790556 | 93.53186262110 |2180 |188 144 18963 
6614E-94 |490 
GO:CC | extracellular vesicle GO:1903561 | 7.4420323209 | 91.12830844815 | 2266 |188 144 18963 
2277E-92 |920 
GO:CC | extracellular organelle GO:0043230 | 8.4412766487 | 91.07359186622 2268 |188 144 18963 
4414E-92 |080 
GO:CC | extracellular region GO:0005576 | 3.0579953528 | 86.51456317890 | 4602 |188 174 18963 
3917E-87 |990 
GO:CC | vesicle GO:0031982 | 8.2511192250 | 59.08348713747 | 4071 |188 147 18963 
1632E-60 |67 
GO:CC | immunoglobulin complex GO:0019814 | 8.4702320398 | 26.07210469212 | 167 188 30 18963 
3511E-27 |030 
GO:CC blood microparticle GO:0072562 | 3.5485845023 | 25.44994484846 | 141 188 28 18963 
7704Е-26 |5500 
GO:CC immunoglobulin complex, GO:0042571 | 4.0356986501 | 15.39408127045 |77 188 17 18963 
circulating 2468E-16 150 
GO:CC | pigment granule GO:0048770 | 7.5739444234 | 9.120677885993 | 105 188 4 18963 
9401E-10 |440 
GO:CC |melanosome GO:0042470 | 7.5739444234 | 9.120677885993 | 105 188 4 18963 
9401E-10 |440 
GO:CC | focal adhesion GO:0005925 | 9.8158643106 | 8.008071453698 | 418 188 23 18963 
4653E-09 |410 
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GO:CC | cell-substrate junction GO:0030055 | 1.4351244399 | 7.843110439540 | 426 188 23 18963 
115E-08 610 
GO:CC | cell surface GO:0009986 | 17874444835 | 6.747767438126 | 922 188 32 18963 
2665Е-07 860 
GO:CC side of membrane GO:0098552 | 0.0000012994 | 5.886251547707 | 632 188 25 18963 
16724860960 | 570 
GO:CC | intermediate filament GO:0005882 | 0.0000017161 | 5.765455137002 |219 188 15 18963 
08976514010 | 490 
0 
GO:CC | external side of plasma GO:0009897 | 0.0000024806 | 5.605427276218 | 420 188 20 18963 
membrane 91301362250 | 01 
0 
GO:CC | actin cytoskeleton GO:0015629 | 0.0000028472 | 5.545579586824 | 513 188 22 18963 
15984554060 | 900 
GO:CC | monomeric IgA GO:0071748 | 0.0000031194 | 5.505927884813 | 4 188 4 18963 
immunoglobulin complex 07522932680 | 340 
0 
СО:СС | intermediate filament GO:0045111 | 0.0000132404 | 4.878097956831 | 255 188 15 18963 
cytoskeleton 2858428850 |440 
GO:CC IgA immunoglobulin complex, | GO:0071746 | 0.0000154759 | 4.810342954205 | 5 188 4 18963 
circulating 40302733600 | 6000 
GO:CC IgA immunoglobulin complex | GO:0071745 | 0.0000154759 | 4.810342954205 |5 188 4 18963 
40302733600 | 6000 
GO:CC | secretory IgA immunoglobulin | GO:0071751 | 0.0000154759 | 4.810342954205 | 5 188 4 18963 
complex 40302733600 | 6000 
GO:CC polymeric IgA immunoglobulin | GO:0071749 | 0.0000154759 | 4.810342954205 | 5 188 4 18963 
complex 40302733600 | 6000 
GO:CC |membrane-bounded organelle | GO:0043227 | 0.0000323140 | 4490608300440 | 12836 |188 159 18963 
72837106000 | 8400 
GO:CC | Arp2/3 protein complex GO:0005885 | 0.0000365267 | 4.437389533159 | 13 188 5 18963 
0245045130 |410 
GO:CC | polymeric cytoskeletal fiber | GO:0099513 | 0.0000689749 | 4.161308298393 | 775 188 25 18963 
9880173190 | 560 
GO:CC | anchoring junction GO:0070161 | 0.0000916230 | 4.037995368755 | 843 188 26 18963 
2605982310 | 3000 
GO:CC | cytoskeleton GO:0005856 | 0.0001118182 | 3.951487175266 | 2302 |188 48 18963 
8440805500 | 6700 
GO:CC | keratin filament GO:0045095 | 0.0001717807 | 3.765025501076 | 97 188 9 18963 
517291480 | 0300 
GO:CC | supramolecular fiber GO:0099512 | 0.0002421224 | 3.615964853858 | 1006 | 188 28 18963 
981236110 |6800 
GO:CC | supramolecular polymer GO:0099081 | 0.0002832205 | 3.547875199317 | 1014 (188 28 18963 
753390020 | 000 
GO:CC  |IgM immunoglobulin complex, | GO:0071754 | 0.0003196971 | 3.495261280214 |3 188 3 18963 
circulating 16944885 |310 
GO:CC | secretory dimeric IgA GO:0071752 | 0.0003196971 | 3.495261280214 | 3 188 3 18963 
immunoglobulin complex 16944885 |310 
GO:CC | dimeric IgA immunoglobulin | GO:0071750 | 0.0003196971 | 3.495261280214 |3 188 3 18963 
complex 16944885 |310 
GO:CC | pentameric IgM GO:0071756 | 0.0003196971 | 3.495261280214 | 3 188 3 18963 
immunoglobulin complex 16944885 |310 
GO:CC  |IgM immunoglobulin complex | GO:0071753 | 0.0003196971 | 3.495261280214 |3 188 3 18963 
16944885 |310 
GO:CC | protein-containing complex GO:0032991 | 0.0008334698 | 3.079110094812 | 5556 | 188 85 18963 
709956400 | 3900 
GO:CC | comified envelope GO:0001533 | 0.0015081372 | 2.821559140234 | 44 188 6 18963 
253341500 |1600 


321 


GO:CC | endoplasmic reticulum lumen | GO:0005788 | 0.0041568279 | 2.381237953573 | 307 188 13 18963 
20143590 |870 
СО:СС | proton-transporting two-sec GO:0033178 | 0.0085603304 | 2.067509469490 | 18 188 4 18963 
АТРаве complex, catalytic 63079390 |880 
domain 
GO:CC | actin filament GO:0005884 | 0.0098495743 | 2.006582536381 | 122 188 8 18963 
67682090 | 6700 
GO:CC  |lamellipodium GO:0030027 | 0.0125771122 | 1.900419063138 | 204 188 10 18963 
46363400 |4100 
GO:CC | endocytic vesicle lumen GO:0071682 | 0.0133450730 | 1.874679043016 | 20 188 4 18963 
9781710 100 
GO:CC | сей periphery GO:0071944 | 0.0170789729 | 1.767538248668 | 5786 |188 83 18963 
7529270 8900 
GO:CC | supramolecular complex GO:0099080 | 0.0182428982 | 1.738906163200 | 1337 |188 29 18963 
89635100 |4200 
Go:cc | collagen-containing GO:0062023 | 0.0296235241 | 1.528363276856 | 424 188 14 18963 
extracellular matrix 6542730 1600 
GO:CC | cell leading edge GO:0031252 | 0.0335225041 | 1.474663546815 | 429 188 14 18963 
4778920 4500 
GO:CC | organelle GO:0043226 | 0.0391685757 | 1.407062219861 | 13974 | 188 160 18963 
6300100 7600 
KEGG | Salivary secretion KEGG:04970 | 1.0683987225 | 11.97126663992 | 93 98 16 7987 
4917E-12 260 
KEGG | Tight junction KEGG:04530 | 0.0004794556 | 3.319251553098 | 168 98 п 7987 
567297980 |300 
KEGG | Staphylococcus aureus infection | KEGG:05150 [0.0011481623 | 2.939996698554 | 91 98 8 7987 
496322400 |4900 
KEGG | Pathogenic Escherichia coli KEGG:05130 | 0.0020672069 | 2.684616045582 | 196 98 п 7987 
infection 404387200 |66 
KEGG | Regulation of actin cytoskeleton | KEGG:04810 | 0.0052675256 | 2.278393342297 | 217 98 п 7987 
33485350 1500 
KEGG | Salmonella infection KEGG:05132 | 0.0178952753 | 1.747261615047 | 249 98 п 7987 
41643600 |8500 
KEGG | Bacterial invasion of epithelial | KEGG:05100 [0.0278434844 | 1.555276416572 | 77 98 6 7987 
cells 28300300 |55 
KEGG | Pentose phosphate pathway KEGG:00030 | 0.0370535592 | 1.431170068964 | 30 98 4 7987 
274054 5500 
KEGG | Carbon metabolism KEGG:01200 | 0.0475077632 | 1.323235416377 | 118 98 7 7987 
5858160 740 
HPA spleen; cells in red pulp[High] | HPA:0530713 | 0.0050472998 | 2.296940890123 | 771 126 23 10596 
94272230 |0200 
НРА salivary gland; glandular HPA:0420053 | 0.0458993433 | 1.338193527484 |1386 | 126 31 10596 
cells[High] 5880240 280 
HPA tonsil; non-germinal center HPA:0600443 | 0.0468033773 | 1.329722807328 | 1260 |126 29 10596 
cells[High] 0768620 210 
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